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Abstract 

In a recent work with Kindler and Wimmer we proved an invariance principle for the slice for low- 
influence, low-degree harmonic multilinear polynomials (a polynomial in xi,...,Xn is harmonic if it 
is annihilated by Xlihi ai“)- Here we provide an alternative proof for general low-degree harmonic 
multilinear polynomials, with no constraints on the influences. We show that any real-valued harmonic 
multilinear polynomial on the slice whose degree is o{s/n) has approximately the same distribution under 
the slice and cube measures. 

Our proof is based on ideas and results from the representation theory of Sn, along with a novel 
decomposition of random increasing paths in the cube in terms of martingales and reverse martingales. 
While such decompositions have been used in the past for stationary reversible Markov chains, our 
decomposition is applied in a non-reversible non-stationary setup. We also provide simple proofs for 
some known and some new properties of harmonic functions which are crucial for the proof. 

Finally, we provide independent simple proofs for the known facts that 1) one cannot distinguish 
between the slice and the cube based on functions of o(n) coordinates and 2) Boolean symmetric functions 
on the cube cannot be approximated under the uniform measure by functions whose sum of influences is 
o{y/n). 
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1 Introduction 


The basic question motivating our work is the following: 

Question 1.1. Assume n is even. How distinguishable are the uniform measure /i on {0,1}" and the 
measure v given by the uniform measure on {0,1}" conditioned on = n/2? 

More generally: how distinguishable are the product measure pLp on {0,1}" in which each coordinate takes 
the value 1 independently with probability p and Vpn given by the uniform measure on {0,1}" conditioned on 
= pn (assuming pn is an integer)? 

Note that the two measures are easily distinguished using the simple sum-of-coordinates test. How does 
the answer change if we “do not allow” the sum-of-coordinate test? From a computational perspective 
we might be interested in restricted families of tests, such as low-depth circuits or low-degree polynomials. 
Furthermore, it turns out that the canonical representation of functions with respect to Vpn, called harmonic 
representation, does not include the function '^Xi (indeed, under Vpn it can be represented as a constant). 

We call {0,1}" the cube, the support of the distribution i/p„ the slice, and the support of v the middle 
slice. For exposition purposes, the introduction will only discuss the middle slice, though all results (previous 
and ours) extend for the case of pLp and ;/p„ for every fixed p. 

1.1 Low-degree polynomials 

In a recent joint work with Kindler and Wimmer m we provided a partial answer to Question 11.11 by 
extending the non-linear invariance principle of m- As mentioned earlier, any function on the slice has 
a canonical representation as a harmonic 0 multilinear polynomial. The harmonic property means that 
Sr=i ~ 0. Suppose that / is a harmonic low-degree low-influence multilinear polynomial. The invariance 
principle of establishes that the distribution of / under the measure v is close to its distribution under 
the product measure p. on {0,1}", as well as to its distribution under the product space R" equipped with 
the product Gaussian measure G = N(l/2,1/4)®". 

The restriction to multilinear harmonic polynomials is quite natural in the slice — as every function on 
the slice has a unique representation as a harmonic multilinear polynomial (this fact, due to Dunkl [13) . is 
proved in Section It is the analog of the implicit restriction to multilinear polynomials in the original 
non-linear invariance principle. Further motivations, from commutative algebra and representation theory, 
are described in Section [TOl 

Both the invariance principle proven in [27] and the one proven in HZ] require that the functions have low 
influences. Indeed, a function like xi has a rather different distribution under p, compared to G- Similarly, 
the function Xi — X 2 has a rather different distribution under v compared to G- However, note that the 
distribution of Xi — X 2 under v is quite similar to its distribution under p. It is natural to speculate that 
low-degree harmonic functions have similar distributions under v and p. Unfortunately, the proof of the 
invariance principle in goes through Gaussian space, rendering the low-influence condition necessary 
even when comparing v and p. 

Our main result in this paper is a direct proof of the invariance principle on the slice, showing that the 
distribution of a low-degree harmonic function on the slice is close to its distribution on the corresponding 
cube. Our results do not require the condition of low influences. 

Theorem 1.2. Let f: { — 1,1}" —>■ R 6e a harmonic multilinear polynomial of degree o(-\/n) and variance 1. 
For any 1-Lipschitz function (p (i.e., one satisfying \p{x) — (p{y)\ <\x — y\), 

|E[^(/)]-E[:^(/)]|=o(l), 

U fl 

and the Levy distanc^ between the distribution of f under fi and its distribution under v is o(l). 

^This somewhat unfortunate terminology is borrowed from Bergeron [JJ Section 8.4], in which an S'n-harmonic polynomial 
is one which is annihilated by multilinear polynomials, both definitions coincide. 

^The Levy distance between two real random variables X, Y is the infimum value of e such that for all t E M it holds that 
Pr[X <t-€]-e< Py[Y < t] < Pr[X < t + e] + e. 
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See Theorem as well as Corollary 14.71 for more quantitative bounds and more general statements 
(which apply in particular to any i.i.d. measure on the cube and the corresponding slice). In Subsection 14.21 
we show that the results cannot be extended to polynomials whose degree is much bigger than ^/ri. 

While Theorem o cannot hold for arbitrary multilinear polynomials, we are able to recover a similar 
theorem for such polynomials by considering the distribution of / on several coupled slices at once; see 
Theorem 16 .1 71 and its corollaries. 

Between representation theory and probability An interesting aspect of our work is the interplay 
between the L 2 theory of the slice and probabilistic arguments based on coupling. The L 2 theory of the 
slice, which was developed by Dunkl [T31|T3], is intimately related to representations of Sn- In Section [3] 
we provide an elementary and self-contained approach to this theory. The L 2 theory is used to control the 
norm of low-degree harmonic functions with respect to several different measures, in particular by using a 
two-sided Poincare inequality. This, in turn, allows us to introduce the main probabilistic argument. 

Novel proof ingredients The basic idea of the proof is to use the coupling method by showing that the 
distribution of / on different slices of the cube is almost identical, as long as the slices are of distance at 
most roughly y/ri (here the distance between the fcth slice and the £th slice is \k — £\). In fact, for our proof 
to work we crucially need to allow distances which are somewhat larger than ^/n. 

To construct the coupling, we use a uniform random increasing path to couple level £ to level k above or 
below it. The main novel technique is representing the difference between the two levels as a difference of 
two martingales. Such representations have been used before in the analysis of stationary reversible Markov 
chains in Banach spaces [28], and even earlier in the analysis of stochastic integrals [26]. However, all previous 
decompositions were for stationary reversible chains, while ours is neither. Our novel representation of the 
differences might be of interest in other applications. We outline this argument in the introduction to 
Section m and carry it out in the rest of the section. 

Applications Except for the natural interpretation of Theorem 11.21 in terms of distinguishing between 
distributions, it can be used to prove results in extremal combinatorics in the same way the main result 
of [17] is used. For example, in Proposition 15.51 we give a proof of the Kindler-Safra theorem on the slice, 
first proved in m- 

1.2 Influences, symmetric functions and circuits 

We prove a few other results that give partial answers to Question ll.il 

• Using direct computation of the total variation distance we prove the following theorem: 

Theorem 17.21 Let / be a function on {0, 1}" depending on o{n) coordinates and satisfying ||/||oo < 1- 
Then 

|E[/]-E[/]|=o(l). 

V fl 

• We prove that symmetric functions cannot be approximated by functions whose total influence is o{y/n) 
(see Proposition 18.51 for a more general formulation): 

Proposition 1.3. There exists a eonstant (5 > 0 such that if f is a symmetric Boolean function such 
that I < E^p[/] < I then Pr^^]/ ^ g] > 6 for every Boolean function g satisfying Inf[gr] = o{^/n). 

Since it is well-known [7], based on arguments from ESHH], that a depth d size m circuit has total 
influence at most 0((logm)'^“^), our result immediately implies circuit lower bounds for such functions. 
However, much better bounds are known, see e.g. [37] for general symmetric functions and [30] for the 
case of the majority function. Nevertheless, Proposition 11.31 is more general as it holds for functions 
/ that are not necessarily the majority function and for functions g that are not necessarily in AC*^. 
Moreover, the proof of Proposition 1 1. 31 is based on a new and simple probabilistic argument. 


4 




1.3 Other results comparing the cube to the slice 

Question 11.11 is not a new question. So we conclude the introduction with a few classical results related to 
this question: 

• The limiting behaviors of the partial sum W{s) = — ^) as s —>■ oo under the cube and 

the slice measures are well-studied. It is well-known that under the cube measure W{s) converges to 
Brownian motion, while under the slice measure it converges to a Brownian bridge. 

• It is well-known that the partial sums W (s) are at least as concentrated in the slice as they are in the 
cube [19]. 

• It is well-known that Lipschitz functions of the random variables cci,... are concentrated both in 
the cube and in the slice. The results for the slice follow from the hypercontractive estimates by Lee 
and Yau [21|. These are also needed in our proofs. 

Paper organization A few useful definitions appear in Section |2| Harmonic functions are defined and 
analyzed in Section [Sj We prove our invariance principle for Lipschitz functions in Section |4| An invariance 
principle for the function ^p{x) = (|a;| — 1)^ appears in Section jS] and is illustrated by a proof of a Kindler- 
Safra theorem for the slice, hrst proved in m- Generalizations to non-harmonic functions are discussed in 
SectionjGj An invariance principle for bounded functions depending on o(n) coordinates is proved in Section|3 
and an invariance principle for Boolean functions whose total influence is o{^/n) is proved in Section |5| We 
explore decompositions of the space of fucntions over the slice in Section jO] pointing out connections to 
several other concepts: associations schemes, regular semilattices, spherical posets, and differential posets. 
Finally, we justify the use of harmonic functions in Section [TUI 
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2 Definitions 

Notation We employ the falling power notation n- = n{n — 1) • • • (^ ~ ^ + !)• The notation 1e equals 1 
if the condition E holds, and 0 otherwise. The sign function is denoted sgn. The L 2 triangle inequality is 
(a -I- 6)^ < 2(a -I- 5)^ or its generalization (X]r=i ^ ■ 

A monomial is squarefree if it is not divisible by a square of a variable. (Thus there are 2" squarefree 
monomials on n variables.) A polynomial is multilinear if all monomials are squarefree. A polynomial 
is homogeneous if all monomials have the same total degree. The dth homogeneous part of a polynomial 
/ = ^ Cmm, denote f^‘^, is the sum of Cmm over all monomials m of total degree d. A polynomial / over 
Xi,..., is harmonic if ^ = 0. 

A univariate function / is C-Lipschitz if \f{x) — /(y)| < C\x — y\. A function is Lipschitz if it is 
1-Lipschitz. 

The expectation and variance of a random variable are denoted E, V, and || • || denotes its L 2 norm 
||A|| = y^EjA^j. To signify that expectation is taken with respect to a distribution a, we write Eq[A], 
Vajcc], and || • ||a. A normal distribution with mean /r and variance is denoted N(/i,cr^). A binomial 
distribution with n trials and success probability p is denoted B(n,p). 
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The symmetric group on [n] = {1,... ,n} is denoted S'„. A distribution on R" is exchangeable if it is 
invariant under the action of Sn (that is, under permutation of the coordinates); a discrete distribution is 
exchangeable if the probability of (xi,..., Xn) depends only on xi + • • • + Xn- For a function / on R." and 
a permutation tt, we define , Xn) = /(a^ 7 i-(i)i ■ ■ • i ^Trin))- We compose permutations left-to-right, so 

that (/“)^ = 

The slice The n-dimensional Boolean cube is the set {0,1}". For an integer 0 < k < n, the fcth slice of 
the n-dimensional Boolean cube is the set 



Probability measures Our work involves two main probability measures, where n is always understood: 


• The uniform measure on the slice is i^k- 

• The product measure on the Boolean cube is given by gip{x) = ^‘(1 — 


Note that Vk,Pk/n have the same marginal distributions. 

3 Harmonic functions 

A basic and easy result states that every function on {—1,1}” has a unique representation as a multilinear 
polynomial, known as the Fourier expansion. It is easy to see that a multilinear polynomial has the same 
mean and variance with respect to the uniform measure on { — 1,1}” and with respect to the standard n- 
dimensional Gaussian measure. In this section we describe the corresponding canonical representation on 
the slice, due to Dunkl nmi] and elaborated by Srinivasan |34| . Filmus m and Ambainis et al. yy . IVIost 
of the results in this section are already known, though the proofs presented in this section are novel. A 
possible exception is the two-sided Poincare inequality for derivatives, Lemma l3.15l 

The canonical representation of functions on the slice is described in Subsection 13.11 We decompose this 
representation into orthogonal parts in Subsection 13.21 where we also deduce that the mean and variance of 
a low-degree function is similar on the slice and on the Boolean cube. The analog of the Poincare inequality 
is proved in Subsection 13.31 alongside results of a similar flavor. Finally, we prove that degree is subadditive 
with respect to multiplication, and monotone with respect to substitution, in Subsection 13.41 

3.1 Canonical representation 

Every function on the slice can be represented as a multilinear polynomial, but this representation is 
not unique. However, as found by Dunkl nani], we can make it unique by demanding that it be harmonic 
in the sense of the following definition. 

Definition 3.1. A polynomial P over xi,... ,Xn is harmonic if 



In other words, P is harmonic if AP = 0, where A is the differential operator ai“- 

Definition 3.2. A basic function is a (possibly empty) product of factors Xi — Xj on disjoint indices. A 
function is elementary if it is a linear combination of basic functions. 

Most, but not all, of the harmonic polynomials we consider will be multilinear. In particular, notice that 
all elementary functions are multilinear. Here are some basic properties of harmonic polynomials. 
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Lemma 3.3. The set of harmonic polynomials is an algebra of polynomials, and is closed under partial 
derivatives, under permutations of the coordinates, and under taking homogeneous parts. In particular, all 
elementary functions are harmonic. 

Proof. Suppose f,g are harmonic. Then A{af + fig) = aAf + flAg = 0; A{fg) = fAg + gAf = 0; 

= 0- Finally, since A(X;Lo/^‘^) = and A/=‘^ is 

homogeneous of degree d — 1, we see that Af^‘^ = =0. □ 

Lemma 3.4. A polynomial f is harmonic if and only if for all Xi,. .. ,a;„,c we have 

f{xi +C,...,Xn+c)= f{xi, ..., Xn)- 

Proof. Given xi,..., a:„, define a function 

(/)(xi, . . . ,X„,c) = /(xi +C,...,Xn+c). 

The chain rule implies that ^ = A/. Hence A/ = 0 iff </> is independent of c. □ 

Our first theorem states that every function on the slice has a unique representation as a harmonic 
multilinear polynomial of degree at most min(A:, n — k). 

Theorem 3.5. Let 0 < k < n. Every function on the slice has a unigue representation as a harmonic 
multilinear polynomial of degree at most min(A:,n — k). 

Proof. We use the notation f = g (read / agrees with g) to denote that / agrees with g on the slice 

We start by proving, by induction on min(fc,n — k), that every function on a slice has some repre¬ 
sentation as a harmonic multilinear polynomial. The base cases, fc = 0 and k = n, are trivial, since in these 
cases all functions are constant. 

Consider now a function / on the slice where 0 < k < n. Define a function g on the slice 

by g{xi ,..., Xn- 2 ) = f{xi ,..., x„_ 2 ,0, 1). By the induction hypothesis, g can be represented as a harmonic 
multilinear polynomial G = g of degree at most min(fc — 1, (n — 2) — (A: — 1)) = min(A:,n — k) — 1. Let 
H = (x„ — Xn-i)G. Note that id is a multilinear polynomial of degree at most min(A:,n — k), and by 
Lemma [3.31 it is harmonic. We claim that / — = H (recall that ") is obtained from / by 

permuting coordinates n — 1 and n). Indeed, when x„_i = x„, both sides vanish; when (x„_i,x„) = (0,1), 
this is true by definition; and when (x„_i,Xra) = (1,0), this is true since = —id, and / — 

satisfies the same identity. Thus / — can be represented as a harmonic multilinear polynomial of 

degree at most min(d,n — k). For short, we say that / — is representable. 

The same argument implies that / — /Of) is representable for all i,j. Applying the same argument to 
/(fe 1-) ^ see that 3 ) jg representable, and so / — 3 ) jg representable. Continuing in this 

way, we see that f — f^ is representable for all tt S S'„. Averaging over all tt S we see that / — is 

representable. Notice that E,r[/’^] is simply the constant function E[/], which is clearly representable. We 
conclude that / itself is representable. 

It remains to prove that the representation is unique. To that end, it is enough to show that if P is a 
harmonic multilinear polynomial of degree at most min(fc, n — k) such that P = 0 then P — 0. We prove 
this by induction on min(fc,n — k). The base cases, k = 0 and k = n, are trivial, since if P is a constant 
polynomial agreeing with the zero function, then P = 0. 

Suppose now that P = 0 on the slice (^^^) for some harmonic multilinear polynomial P of degree at most 
min(d,n —d), where 0 < k < n. Write P = A-|-Xti_iP-I- x„C-|-x„_iXnP, where A,B,C,D are polynomials 
over Xi,... ,x„_ 2 , and notice that P — pl"”^ = {xn-i — Xn){B — C). Since A[(x„_i — Xn){B — C)] = 

(x„_i — Xn)A{B — C), we see that B — C is harmonic. Considering the substitution (x„_i,x„) = (0,1), we 
see that P — C is a harmonic multilinear polynomial of degree at most min(d, n — d) — 1 which agrees with 
0 on the slice induction hypothesis implies that B — C = 0, and so P — p(”“i "Z 

The same argument implies that P = p(®f) for all i,j, and so P = P^ for all tt. Hence P is a symmetric 
polynomial, and so it is a function of xi • --t-x^, say P = ^'(a^i + • • ■ + Xn) for some polynomial (f. Lemma[T4] 
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implies that (/) is constant, and so P is constant. Since P agrees with 0, we must have P = 0, completing 
the proof. □ 


Similarly, we can prove that every harmonic multilinear polynomial is elementary. 

Lemma 3.6. A multilinear polynomial is harmonic iff it is elementary. In particular, a harmonic multilinear 
polynomial over Xi,... ,Xn has degree at most n/2. 

Proof. Lemma 13.31 implies that every elementary polynomial is harmonic. We now prove that every mul¬ 
tilinear harmonic polynomial is elementary by induction on n. If n = 0 then any polynomial is constant 
and so elementary. If n = 1 then any harmonic polynomial is constant and so elementary. Consider 
now any n > 2 and any harmonic multilinear polynomial / over Xi,. .. ,Xn- For any i j we can write 
f = A + XiB XjC + XiXjD, where A,B,C,D don’t involve Xi,Xj, so that / — = (a;^ — Xj){B — C). 

Notice that / — is harmonic, and since A[(a;i — Xj){B — C)] = {xi — Xj)A(B — C), we see that B — C 
is also harmonic. By induction, we get that B — C is elementary, and so / — = {xi — Xj){B — C) is 

elementary. 

Applying this to , we get that i) jg elementary, and so is / — i), Continuing in 

this fashion, / — is elementary for all ir G Sn, and so / — is elementary. The polynomial is 

symmetric and so [/'"’] = -I- • • • -I- Xn) for some polynomial (j). Lemma 13.41 implies that (j) is constant, 

and so E^ [f^] is constant as well, and so elementary. We conclude that / is elementary. □ 

The algebraically inclined reader may enjoy the following reformulation of Theorem l3.51 

Corollary 3.7. Let 0 < k < n, and fix a field F of characteristic zero. Consider the polynomial ideal 



where we think of the slice as an affine variety. Then ¥[xi,... ,Xn]/1 is isomorphic to the ring of 
harmonic multilinear polynomials of degree at most min(fc, n — k) over xi,... ,Xn- 

We discuss this reformulation in Section 110.11 giving an alternative proof of the theorem. 

We conclude this section by computing the dimension of the space of harmonic multilinear polynomials 
of given degree. 

Corollary 3.8. Let d < n/2. Then 

(a) The linear space H<d of harmonic multilinear polynomials of degree at most d has dimension (^). 

(b) The linear space Hd of harmonic multilinear polynomials which are homogeneous of degree d has dimen- 

(d) - id-i)’ ( "J = 0. 

Proof. The first item follows directly from Theorem 13.51 taking k = d. Since dim H<d = '^g^odimHe, the 
second item follows from the first. □ 

For another proof, see the proof of Theorem 110.11 

3.2 Orthogonality of homogeneous parts 

As stated in the introduction to this section, multilinear polynomials enjoy the useful property of having the 
same mean and variance with respect to all product measures with fixed marginal mean and variance. The 
corresponding property for harmonic multilinear polynomials is stated in the following theorem, which also 
follows from the work of the first author m- A representation-theoretic proof of the theorem is outlined in 
Section 19.11 





Theorem 3.9. Let f,g be homogeneous harmonic multilinear polynomials of degree df,dg, respectively, and 
let a he an exchangeable measure. If df dg then ¥,a[fg] = 0. If dj = dg = d then there exists a constant 
Cf^g independent of a such that 

E[fg] = Cf^g E[(a;i - X 2 f ■ ■ ■ {x 2 d-i - X 2 d)'^]- 

Oi Oi 

Proof. Let h = fg, and note that Lemma 13.31 implies that h is harmonic. Let H = and note that 

H is also harmonic. Since a is exchangeable, E^lfg] = EQ,[iL]. Our goal is to show that ^a[H] = P{f,g)I'd,a, 
where „ does not depend on / or g. We first note that iL is a linear combination of the functions 

bt = E • • • a;.,r(d/+dg-t)) 

TTt On 


say H = J2t^tbt. 

When a = N(^, cr^), we have 


min{df ,dg) 

Since H is harmonic, Lemmaimplies that En(^ doesn’t depend on p,, and so plugging in the value 

^ = 0 we see that 

min(dy ,dg) 

t=0 

Consider first the case where df Y dg. In this case when we plug /i = 0 in the equation above all of 
the terms on the left-hand side vanish and therefore the right-hand size vanishes as well. Note that the 
polynomials (cr^ -I- iin 0 aj.iy independent as formal polynomials in pi, a. Since their linear 

combination vanishes it follows that /3t = 0 for all t, and so Ea[fg] = Eo,[iL] = 0 for all exchangeable a. 
Next, consider the case where df = dg = d. We now get 

t^O 

Note that the polynomials (cr^ -I- are linearly independent. This implies that up to multiplication 

by a scalar, there exists exactly one vector (3 that satisfies the equation above. In other words, there exists 
a vector of coefhcients 7 = ( 70 ,..., 7 d) with 7 d = 1 such that (3 = PdY and such that 7 does not depend on 
f,g or H. We therefore conclude that Ea[fg] = Ea[iL] = Pd{f,g)I'd,a for some parameter F^.a depending 
on d, a. 

Choosing f = g = {xi — X 2 ) ■ ■ ■ {x 2 d-i — X 2 d), we find that 

E \H] = 2'^cr2d 


and so Pd{f,g) = 2'’*. This implies that 

Fd.a = = 2"‘’'E[(a;i - X 2 Y ’ ’ ’ (a^2d-i - X2dY], 

Pd(f,g) 


and so we conclude that 


mg] = 2 ‘^(3d{f,g)mxi - X2Y---{x2d-l - X2df\- 

ot a. 


□ 
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Corollary 3.10. Let f be a harmonic multilinear polynomial of degree at most d with constant coefficient 
Suppose that a, ft are exchangeable measures and C > 0 is a constant that for t < d satisfies 

E[(a:i - X2)'^ ■ ■ ■ {X2t-1 - X2t)'^] < C'E[(xi - X2)'^ ■ ■ ■ {X2t-1 - X2tf]. 

a /3 

Then E4f] = /=o, ||/||2 < C\\f\\% and V„[/] < CYp[f]. 

Proof. Write / = where /=* is homogeneous of degree t. Theorem 13.91 implies that are 

orthogonal with respect to all exchangeable measures. This implies that Ea[/] = and 

ll/ll^ = Ell,r‘llL 

The theorem also implies that for some , Kd we moreover have 

d d 

WfWi = '^KtE[{xi - X2f ■■■ (X2t-1 - X2t)'^] < C'^KtE[{xi - X2f---{x2t-l - X2tf] = CWfWl- 

t^o ^ t^o ^ 

Finally, since / — is also harmonic, we deduce that Y[f]a < CY[f]p. □ 

The following lemma computes E[(a;i — X 2 y ■ ■ ■ {x 2 d-i — X 2 d)'^] for the measures lyk, iJ-p- 
Lemma 3.11. Let p = k/n. We have 


EJ(X1 - X2)^ • • • ix2d-l - X2dr] = = ( 2 p(l - p)r (l ± O ’ 

E [(xi - X2)^ • • • ix2d-l - X2d)'^] = ( 2 p(l - p)^. 

fip 

This straightforward computation appears in m Theorem 4.1] and m Lemma 2.9]. Qualitatively, the 
lemma states that the norm of a low-degree basic function is similar in both Vk and Pp. This is not surprising: 
the coordinates in the slice are almost independent, and a low-degree basic function depends only on a small 
number of them. 


3.3 Poincare inequalities 

We proceed by proving the so-called two-sided Poincare inequality, starting with the following fact. 
Lemma 3.12. Let f be a harmonic multilinear polynomial. Then 





d{n — d -|- 1) 


c—d 


where f ^ is the dth homogeneous part of f. 

Proof. In view of Lemma 13.61 / is elementary. Therefore it is enough to consider the case that / is a basic 
function of some degree d, say / = nt=i(^“t ~ Xbt)- We split the transpositions into four kinds. The first 
kind is transpositions which do not involve any at,bt. There are such transpositions (z j), and they 

all satisfy /bt) = f. The second kind is transpositions of the form {at bt). There are d of these, and they 
satisfy The third kind is transpositions of the form {at j) or {bt j), where j ^ as,&s- There 

are d{n — 2d) pairs of these. Since 

{Xat - Xbt ) + {Xat - Xbt ) = Xat “ Xbt > 
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each such pair contributes one multiple of /. The fourth kind is transpositions involving two pairs (at bt), (as bg), 
which we group in the obvious way into ( 2 ) quadruples. Direct computation shows that 

X! - Xtt){Xa, - Xbjr = ‘^{Xat “ Xbt){Xa, “ XbJ, 

7rG{(at Os),(at as),{bt ba)} 


and so the contribution of each such quadruple is two multiples of /. In total, we obtain 


i<j 


n — 2d\ I'd 

2 j - d +d{n-2d) + 2i 


f = 


— d{n — d+1) 


/• 


□ 


Lemma 3.13. Let f be a harmonic multilinear polynomial of degree at most d. Then with respect to any 
exchangeable measure, 

nV[/] < i ^ 11/ - < d{n -d+l)Y[f]. 

i<j 

Proof. Write / = Theorem 13.91 implies that the homogeneous parts are orthogonal. We have 

i E11/ - II' = ( 2 ) ll/ll' - (/’ E 

i<j ^ '' i<j 

Lemma [3.121 implies that 

^Eii/-/^“^^ii' = E^(^-^+i)ii/=‘ii'- 

i<j t—0 

The lemma now follows from the observation that for 1 < t < d we have n < t{n — t + 1) < d{n — d + 1), 
since t{n — f + 1) is increasing for t < {n + l)/2. □ 


Finally, we prove another two-sided Poincare inequality, this time for derivatives. We start with the 
following surprising corollary of Theorem 13.91 

Lemma 3.14. Let f,g be homogeneous harmonic multilinear polynomials of degree d. Then for any ex¬ 
changeable measure a, 




9f dg 

dxi dxi 


]Ea[/5] 


^ = 2d 


Ea[(xi - X2)‘^ ■ ■ • ix2d-3 - X2d-2)‘^] 
Ea[{xi - X2)^ ■ ■ ■ ix2d-l - X2d)‘^] 


Proof. Since the functions ^ are harmonic. Theorem 13.91 implies that there are constants Cf^g,Df^g 
such that 

Er=iEo 


df dg 
dxi dxi 


Df^g E„[(a;i - X 2 )'^ ■ ■ ■ {x2d-3 - X 2 d- 2 )‘^] 


Eq[/ 5 ] C'/.S Ea[{xi - X2)'^ ■ ■ ■ ix2d-l - X2d)‘^] 

Here D^g = X:r=i C. df dg ■ Wg Ccin GvalucitG tliG ratio Djg/Cjg by considGring thG distribution ex — 

dxj^ ’ dxj^ 

N(0,J„): 


LLi En(o,/„) 


df dg 
dxi dxi 




D 


f :9 


EN( 0 ,/„)[/d] 

On the other hand, with respect to N(0, /„) we have 


^f,g 2"^ 20/^5 


y]E 

2=1 


df dg 
dxi dxi 


= EE /(‘^)5(5) = E \S\f{S)m = dE[fg]. 

i=l S 


We conclude that Df_g/{2Cf^g) = d, and so Df,g/Cf^g = 2d. 


□ 
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We deduce the following two-sided Poincare inequality. 


Lemma 3.15. Let f be a harmonic multilinear polynomial of degree d, and let a be an exchangeable measure. 
Suppose that for 1 < t < d we have 


E„[(a;i - X2)^ ■ ■ ■ ix2t-i - X2tY] 


Then also 


i=l 


df 


dxi 


<MY[f]. 


Proof. Write / = J2t=o notice that ^ = J2t=i latter sum being a decomposition into 

homogeneous parts. Theorem 13.91 implies that the homogeneous parts are orthogonal with respect to a, and 
so 


n It “Hi: 


dxi 


t=l i=l 


df- 


dxi 


Using Lemma |3.14l we can upper bound 


EE 


dr 


dxi 


<M^||/=‘f =MV[/]. 


t=l i=\ 

The lower bound is obtained in the same way. 

The following lemma computes m,M for the measures VkjfJ-p. 

Lemma 3.16. Let p = k/n. We have 

[(a^i - 3 ^ 2 )^ • ■ ■ ix 2 d -3 - X 2 d- 2 )‘^] _ ^_{n -2d + 2)(n -2dp\) 


[(a;i - X 2 Y ■ ■ ■ {x 2 d-i - X 2 d)^] {k- d + l){n - k - d + 1) 


i±o 


2d 


pil-p) 

~ 3 ^ 2 )^ • ■ ■ {x2d-3 - X2d-2)‘^] _ d 
E^p [(a;i - X 2 Y ■ ■ ■ {X 2 d -1 - X 2 d)^] p(l - p)' 


p{1-p)t 


□ 


The proof is a straightforward corollary of Lemma 13.111 


3.4 Cometric property 

Theorem 13.51 states that every function on the slice can be represented as a harmonic multilinear 
polynomial. The following result shows that if the original function can be represented as a polynomial of 
degree d, then its harmonic representation has degree at most d. 

Lemma 3.17. Let 0 < k < n. Let f be a polynomial of degree d on the variables Xi,... ,Xn. The unique 
harmonic multilinear polynomial agreeing with f on the slice has degree at most d. 

Proof. We can assume without loss of generality that / is multilinear, since on the slice a:| = Xi. Hence 
it suffices to prove the theorem for the monomial f = Xi - ■ ■ Xd. Theorem 13.51 states that there is a unique 
harmonic polynomial g of degree at most min(fc, n — k) agreeing with / on the slice . If d > min(fc, n — k) 
then the lemma is trivial, so assume that d < min(fc, n — k). 

Decompose g into its homogeneous parts: g = -I- • • • -b ^ show that if e satisfies 

d < e < min(fc, n—k) and /i is a basic function of degree e then (/, h) = 0, where the inner product is with 
respect to the uniform measure on the slice. Since / agrees with g on the slice, it follows that {g, h) — 0. 
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Lemma 13.61 shows that is a linear combination of basic functions of degree e (since basic functions are 
homogeneous), and so ( 5 , 5 ^®) = 0. Theorem 13.91 implies that = 0. Since this is the case for all e > d, 
we conclude that deg 5 < d. 

Consider therefore a basic function h = (xa^ — Xb^) ■ ■ ■ (xa^ — Xb^)- We have 

{Xa,-Xb,)---{Xa,-XbJ. 

Xi = --- = Xd = l 

Since e > d, for some i it holds that ai,bi > d. The only non-zero terms in the sum (if any) are those for 
which Xai Xbi- We can match each term in which Xa^ — Xb^ = 1 with a term in which — Xbi = — 1, 
obtained by switching and Xf,^. The two terms have opposite signs and so cancel. This shows that the 
entire sum vanishes, completing the proof. □ 

As a corollary, we deduce the so-called Q-polynomial or cometric property of the slice. For a function / 
on a slice, denote by deg / the degree of its unique harmonic multilinear representation on the slice. 

Corollary 3.18. Let f,g be functions on a slice. Then deg{fg) < deg / -I- degg. 

Similarly, we have the following substitution property. 

Corollary 3.19. Let f be a function on a slice, and let g be the function on a smaller slice obtained by 
fixing one of the coordinates. Then degg < deg/. 

Both results are non-trivial since the product of two harmonic multilinear polynomials need not be 
multilinear, and substitution doesn’t preserve harmonicity. 


4 Invariance principle 

In this section we prove an invariance principle showing that the distribution of a low-degree harmonic 
function on a slice is similar to its distribution on the Boolean cube with respect to the measure pk/n- 
For convenience, we analyze the similarity in distribution via Lipschitz test functions, and derive similarity 
in more conventional terms as a corollary. The basic idea is to show that the distribution of a low degree 
function on a given slice (^^^) is similar to its distribution on nearby slices (^"^). If we can show this for all 
slices satisfying \k — i\ < B for some B = u!{-s/n), then the invariance follows by decomposing the Boolean 
cube into a union of slices. 

The argument Before giving the formal proof, we provide the intuition underlying the argument. Our 
argument concerns the following objects: 

• A harmonic multilinear polynomial / of degree d and unit norm. We think of d as “small”. 

• A Lipschitz function ip. 

• A slice (p^). We think of p as constant, though the argument even works for sub-constant p. 

Our goal is to show that [Tif)] ~ [rif)]- The first step is to express pp as a mixture of for various 

£: 

E b(/)] = E W/a - P)”-" EN/)]- 

Applying the triangle inequality, this shows that 

iE[^(/)]- ¥,[ipm<j2{%\^-pT-'\nTU)]- EH/)]|. 

Mp i^pn ^ \lJ ye. ypn 


13 


In general we expect | E^f[ip{f)] — ]Ei,p„[(/3(/)]| to grow with |£ —pn|, and our strategy is to consider separately 
slices close to pn, say \pn — £| <6, and slices far away from pn, say \pn — ^\ >6. We will bound the 
contribution of slices close to pn directly. If <5 is large enough then we expect the contribution of slices far 
away from pn to be small, essentially since Pp is concentrated on slices close to pn. For this argument to 
work, we need to choose S so that S = uj{^/n). 

It remains to bound [(/?(/)] — Ejyp„ [</?(/)]! for i close to pn. One strategy to obtain such a bound 
is to bound instead | E,yJ(p(/)] — [(p(s)]| for various s, and use the triangle inequality. To this end, it 

is natural to consider the following coupling; let (X(s),X(s + 1)) £ x (J"\) be chosen uniformly at 
random under the constraint X(s) C X(s + 1). We can then bound 


|E[;^(/)]- E [;^(s)]| = |E[;^(/(X(s)))-^(/(X(s + l)))]|< 

I's l^B + 1 

E[|;^(/(X(s))) - (^(/(X(s + 1)))|] < E[|/(X(s)) - /(X(s + 1))|]. 


Denoting 7r(s + 1) = X(s + 1) \ X(s) and using the multilinearity of /, this shows that 


E[^(/)]- E Ks)]| <E 

I's + l 


df 


dx^ 


(s+l) 


-(X(s)) 


= E 


— E 


2^X(s) 


dx. 


(X(s)) 


While we cannot bound |^| directly. Lemma 13.151 implies that = 0{d). Applying Cauchy- 

Schwartz, we get that for s close to pn, 


Us 


E Ns)]|<^E 


< 


0(n) 


E 


E 

y/n 


K 

dxi 


:x(s)) 






= o 



Recall now that our original goal was to bound \ E,^^[(f{f)] — E,^^^[(p{f)]\ for \£ — pn\ < S, and our intended 
<5 satisfied 6 = uj{y/n). Unfortunately, the idea just described only gives a bound of the form | Ei,J(/5(/)] — 
Ei/p„ [</5(/)]I = 0 {6y/d/n), which is useless for our intended (5. 

One way out is to take 6 = C\/n. This allows us to obtain meaningful bounds both on the contribution 
of slices close to pn and on the contribution of slices far away from pn. Although this only gives a constant 
upper bound on |E^p[(/3(/)] — Ep,^„ [(/3(/)]| if applied directly, this idea can be used in conjunction with the 
invariance principle for the Boolean cube m to give an invariance principle for the slice, and this is the 
route chosen in the prequel m- One drawback of this approach is that the invariance principle for the 
Boolean cube requires all influences to be small. 

Our approach, in contrast, considers a coupling (X(0),..., X(n)) of all slices. Analogous to /(X(s + 
1)) — /(X(s)), we consider the quantity 


C(s) = {n- s){f{X{s + 1)) - /(X(s))) - s(/(X(s - 1)) - /(X(s))). 

As before, we can bound E[|C(s)|] = 0{V^)- Moreover, 

t 

Ciu) = {n- t)fiX{t + 1)) + {t- l)/(X(t)) -{n-s- l)/(X(s)) - s/(X(s - 1)), 

U—S 


and so we can bound | E^,^ [p{f )] — E,,^^ [fif)] I by bounding the expectation of Yfu=pn ^('“) C(u). 

The triangle inequality gives | J2u=s C!(m)| = 0 (|s — t\Vdn), which suffers from the same problem that we 
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encountered above. However, by expressing C(s) as a difference of two martingales, we are able to improve 
on the triangle inequality, showing that 


a bound which is useful for |s — t| 
In more detail, we define 




= 0(a/|s - t\dn), 


o{n/d) rather than for |s — t| = o{y/n/d) as before. 


U(u) = f{X{u + 1)) - f{X{u)) - E[f{X{u + 1)) - f{Xiu))\X{u)], 

D(u) = f{X{u - 1)) - /(X(u)) - E[/(X(u - 1)) - /(X(w))|X(u)], 

both martingales by construction, U('u) for increasing u, and D(m) for decreasing u. We claim that C(m) = 
(n — m)U('u) — ■uD(m). If this holds, then using the fact that E[U(u)U(u)] = E[D(m)D(u)] = 0 for u ^ V and 
the L 2 triangle inequality (a + b)^ < 2a^ + 26^, we get 


E 



< 2E 


w)U(m) 


2E 


XwD(m) 


21 


t t 

= 2X(«-m)^E[U(u)2] +2Xw^E[D(m)^]. 

U—S U—S 


This shows that C(u))^] scales linearly in t — s rather than quadratically in t — s, which is what 

we would get if we just applied the triangle inequality. Since the Li norm is bounded by the L 2 norm, we 
conclude that E[| X]L=s = 0(\/|s — t|c?n). 

Finally, let us explain why C{u) = (n — M)U(t6) — uD(m). In view of our previous expression for C(m), 
this boils down to proving that 


(n - u)E[f{X{u + I)) - f{X{u))\Xiu)] - uE[f{X{u - 1)) - f{X{u))\X{u)] = 0. 


We can rewrite the left-hand side as 


E 


X [/(X(u)UW)-/(X(u))]- X [/(X(«)\W)-/(X(«))] 

i^x(«) iex(M) 


Since / is multilinear, we can replace the differences with derivatives: 


E 






df 


zGX(n) 


dxi 


:E 




2=1 


However, the last expression clearly vanishes, since / is harmonic. This completes the outline of the proof. 


4.1 The proof 

The basic setup of our argument is described in the following definition. 


Definition 4 . 1 . We are given a harmonic multilinear polynomial / of degree d > I. 

Let TT € S'n be chosen uniformly at random, and define random variables X(s) G (^"1) for 0 < s < n as 
follows: 


1 iff G 7r({I,...,s}), 

0 if i G 7r({s -I- I,..., n}). 
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For 0 < s < n, define random variables U(s) (for s ^ n), D(s) (for s 0) and C(s) by 
U(s) = /(X(s + 1)) - /(X(s)) - E[/(X(s + 1)) - /(X(s))|X(s)], 

D(s) = /(X(s - 1)) - /(X(s)) - E[/(X(s - 1)) - f{X{s))\X{s)], 

C(s) = (n-s)U(s) -sD(s). 

In words, X(0),..., X(n) is a random maximal chain in the Boolean cube. These random variables form 
a coupling of the uniform distributions on all slices. The random variable U(s) measures the effect of moving 
up from X(s) to X(s + 1) on /, normalized so that it has zero mean given X(s). The random variable D(s) 
similarly measures the effect of moving down from X(s) to X(s — 1). Finally, C(s) measures the effect 
of moving away from X(s). The usefulness of this representation stems from the fact that U(s) and D(s) 
are martingale differences and are therefore orthogonal while C(s) is on one hand easily expressed in terms 
of D(s) and U(s) and on the other hand is useful for bounding differences of /. This representation of a 
random directed path in terms of martingales and reverse martingales is inspired by previous work using 
such representations for stationary reversible Markov chains and for stochastic integrals [26l [28] . Part of the 
novelty of our proof is using such a representation in a non-stationary setup. In particular, the formula for 
the representation is significantly different from previous applications of the method. The basic properties 
of the representation are stated formally in the following lemma. 

Lemma 4.2. The following properties hold: 

(a) Ifs^t then E[U(s)U(t)] = E[D(s)D(t)] = 0. 


(b) For all s we can bound 


E[U(s) 2] < 4E[(/(X(s + 1)) - /(X(s)))2] < -^ 

n — Q f ^ 


2=1 


df 


E[D(s) 2] < 4E[(/(X(s - 1)) - /(X(s)))2] < - ^ 


2=1 


df 


dxi 

2 


dxi 


2 


(c) For all s we have C(5) = (n — s)/(X(s + 1)) — (n — 2s)f{'K{s)) — sf{'K{s — 1)). 

(d) For all s < t we have 

t 

Y, C{u) = {n- t)f{X{t + 1)) + {t + l)f{X{t)) -{n-s + l)/(X(s)) - sf{X{s - 1)). 


Proof. For (jg) note that if s < t then E[U(t)|X(s + l),X(s)] = 0, and so E[U(t)U(s)] = 0. For similar 
reasons we have E[D(t)D(s)] = 0. 

For ©, note first that the L 2 triangle inequality implies that E[U(s)^] < 4E[(/(X(s + 1)) — /(X(s)))^], 
using the bound 

E [E[/(X(s + l))-/(X(s))|X(sr]< E [E[(/(X(s + 1))-/(X(s)))2|X(s)]]=E[(/(X(s + 1))-/(X(s)))2]. 

X(s) X(s) 

Since / is multilinear, we have f(X(s + 1)) = f(X(s)) + —(X(s)), and so 

^^■7r(s + l) 


IE[U(s)2] < 4E 


df 


dx. 


-(X(s)) 


■E 


7r(s+l) 

E 

i: X(s)i=0 


dxi 


:x(s)) 


<—y E 

71 — Q f ^ 


2=1 


K 

dxi 


(X(s)) 
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This implies our estimate for E[U(s)^]. The estimate for E[D(s)^] is obtained in the same way. 
For (toj), we again use the formula /(X(s + 1)) = /(X(s)) + —(X(s)) to obtain 

^^7r(s + l) 


E[/(X(s + l))-/(X(s))|X(s)] = 


■E 


E 


i: X(s)i=0 


Similarly, using the formula /(X(s — 1)) = /(X(s)) — (X(s)) instead, we have 

df 


E[/(X(s-l))-/(X(s))|X(s)] = --E 


E 


1 -. X(s)i=l 


dx. 


■(X(s)) 


Combining these formulas together, we obtain 


C(s) 


(n 

(n 


s)/(X(s + l))-(n 
s)/(X(s + l))-(n 


2s)/(X(s))-s/(X(s-l))+E 



2s)/(X(s))-s/(X(s-l)), 


since / is harmonic. 

Part 0 follows by simple computation, whose highlight is noticing that the coefficient of /(X(u)) for 
s < u < t in the sum is {n — u + 1) — {n — 2u) — (u + 1) = 0. □ 

The exact definition of C(s) is aimed at the cancellation of the derivative terms in the proof of Lemma l4.2l[gj) . 
in which we use the harmonicity of /. We also use the harmonicity of / to bound the variance of / with 
respect to various slices. 

The next step is bounding the quantity appearing in Lemma |4.2t|b|) in terms of d, the degree of /. This 
is a simple application of Lemma 13.151 on page 1121 

Lemma 4 . 3 . For every integer 0 < qn < n such that d < q{l — q)n we have 



Proof. Combine Lemma 13.151 with Lemma 13.161 □ 

Our work so far has not focused on any specific slice. Now we turn to the following question. Suppose 
that f has unit variance on the slice What can we say about its behavior on a nearby slice 

Lemma 4.4. There exists a constant K > 0 such that the following holds. Let 0 < pn, qn < n be integers 
such that \p — q\ < p(l — p)/(2d), and suppose that = 1. If df < Kp{l — p)n then 

E[(/(X(pn)) - /(XM))2] = O • 

Proof. Let k = pn and i = qn, and assume that k < i (the other case is very similar). Note that \{d/dp)p{l — 
p)! = |1 — 2p| < 1 for p G [0,1], and so |p(l — p) — q{i. — q)\ < \p — q\ < p(l — p)/{2d). Therefore for 
K < 1I2 the assumption d^ < Kp{l — p)n implies that < r(l — r)n for all r G [p, g], since r(l — r) > 
p{l -p)- p(l - p)/(2d) > p(l - p)/2. 

Lemma 11210 implies that 


e 

A 4 ^ C(u) = {n- e)f{X{£ + 1)) + {£ + l)f{X{£)) -{n-k + l)/(X(fc)) - fc/(X(fc - 1)). 

u—k 
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As a first step, we bound E[A^]. The L 2 triangle inequality implies that 


E[A2] < 2E 


J2in-u)V{u) 


\u—k 


+2E 


^mD(u) 


\u—k 


A2 




Lemma IT^|a|) and Lemma IT^tlbl) show that 

£ £ n 

u—k u—k i—1 

Applying Lemma [4.31 fusing cP < r(l — r)n for all r G [p, g]), we obtain 


df 


dxi 




d{n — u) 


[u/n){l — u/n) 


¥[/].„ < ^ O 


dn 


p{l-p) 


V[/]. 


U—k \ ^ / u—k 

Corollary 13 .101 together with Lemma [3.111 (and our assumption on d) implies that 


A^ 




u—k 


dn 


pil-p) 


/ |t' — k\dn\ 
\p{l-p) ) 


We can bound A| similarly, and conclude that 


E[A2] = O 


/1€ — fc|dn\ 
\p{l-p) ) ' 


The triangle inequality implies that 


(n + 1)|1/(X(£)) - /(X(fc))|| < II All + (n - ^)||/(X(£ + 1)) - /(XW)|| + fc||/(X(A - 1)) - /(X(A))||. 

The latter two terms can be bounded in the same way that we bounded E[U(rt)^] above, and we conclude 
that 

{n + 1)||/(XW) - /(X(fc))|| = O . 

This implies that 

i:|(/(X(p,.)) - /(X(,„)))>i = o (= o (h^) . 

Combining Lemma 14.41 with Chernoff’s bound, we obtain our main theorem. 

Proposition 4.5. Let 0 < p,e < 1 satisfy e < p{l — p). Then 

Pr[|B(n,p) - np\ > en] < . 

Proof. Suppose first that p < 1/2, and let X ~ B(n,p) and p, = np. One common version of Chernoff’s 
bound states that for 0 < 5 < 1 we have 

Pr[|A -p\> Sp] < 

Choose 6 = e/p, and note that 6 < 1—p < 1. Since 6'^p = (e^/p^)(np) = e^n/p > e^n/{2p{l—p)), the bound 
follows in this case. 

When p > 1/2, we look at A ^ B(n, 1 — p) instead, using the fact that n — X ^ B(n,p). □ 
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Theorem 4.6. There exists a constant K > 0 such that the following holds, whenever p{l — p)n > 31ogn. 

Let f be a harmonie multilinear polynomial of degree d satisfying d < K\Jp{l — p)n such i/iai V[/]yp„ = 1. 
For any Lipschitz function p, 

I E [pif)] - E[<p(/)]| = O ( I ^ ] . 

’'pn Up \ W y'p[l — p)n J 

Proof. We can assume, without loss of generality, that E[/] = 0 ICorollarv l.S. 101 implies that the expectation 
is the same with respect to both Vpn and /ip), so that E[/^] = 1, and that (/j(0) = 0. Since p is Lipschitz, it 
follows that \p{x)\ < |a:| for all x. 

Let 5 := p(l — p)/{2d). Lemma [4.41 implies that whenever \q — p\ < 6 , 

I E Mf)] - E Mf)]\ = \EMf{X{pn))) - p{f{Xiqn)m < |E[/(X(pn)) - /(X(gn))]| 

t^pn L'qn 

< WfiXipn)) - /(XM)II = O . 


E[pif)]-EMf)]\ = 

^pn f-^p 


Expressing pp as a mixture of distributions of the form Vqn, we obtain 

EN/)]-E -P)”-'E[^(/)] 

-- ^ VJ 

^ E - pT~'\ E [^(/)] - 

VJ 

,iv^IWii + 


fco 

< O 


p(l -p) e~B(n,p) 


C 


Pr[|B(n,p) -pn\ > 5n] • | E [</?(/)]! + E [\p{f{x))\ ■ l\^,a:i-pn\>5n] ■ 

ypn Xr^flp 

V ^ V ^ 

^ V V 

ei €2 

The coefficient C can be bounded using Proposition id. bl and the formula E[X] = Pr[X > t] dt (for X > 0): 


pOO _ pOO 

C = / Pr[\/|B(n,p)/n — p\ >t\dt= / Pr[|B(n,p) — pn| > t^n] dt 

JO JO 

/■\/p(i-p) 


< 


2 g-t n/{ep(i-p)) / _2e-P(i-P)"/6 dt. 


Substitute s = y/n/6p(l — p)t to get 


C <2 




4 p{l-p) 


since e-P(i-P)"/6 < ^ whereas (/eM = > i _ 

We proceed to bound the error terms ei,e 2 . With respect to Vpn, E[|(/3(/)|] < E[|/|] < ||/|| < 1, and so 
Proposition 14.51 implies that 

ei < 

For the second error term, let M > 0 be a parameter to be determined. We have 

£2 < MPr[|B(n,p) - pn\ > dn] + [\p{f{x))\ ■ 1|^(/(x))|>m]- 
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If \(p{f{x))\ > M then certainly |/(a:)| > \(p{f{x))\ > M, and so 

^EJ\ipif{x))\■ 1|^(/(,))|>m] < ■ 1|/(x)I>m] < ^ Ej/2] = O , 

since Lemma 15.111 and Corollary 13.101 imply that ||/||^p = 0(||/|Up„) = 0(1) for small enough K. Hence 

£2 < + O . 

Choosing M = e--5"’^/(i2p(i-p))^ ^e conclude that 

Putting eyerything together, we obtain 


I EMf)]-EMf)]\ = o[J 

Pp y\j ^yp[l—p)n 

Substituting 5 = p{l — p)/{2d), we deduce 


_|_ g 12p(l-p) 


EMf)]-E[p{f)]\=0 

^pn f-^p 


a/p( 1 -p)n 


p(l-p)n 

+ e 48d^ 


V 


/ 


Note that B = e When A < 0.38, calculation shows that B < A. Since A < y/K, choosing 

K < 0.38^ ensures that B < A. □ 


As a corollary, we can estimate the Leyy distance between /(t'pn) and f{pp), along the lines of P71 
Theorem 3.19(28)]. 

Corollary 4.7. Suppose that p{l — p)n > 31ogn, and let f be a harmonic multilinear polynomial of degree 
d satisfying d < K^p{l — p)n such that V[/],ypp = 1, where K > 0 is the constant from Theorem \4-(>\ The 
Levy distance between f{vpn) and f{fJ,p) is at most 

d 

\/p(l -p)n 

That is, for all y it holds that 

Pr[/ < y - e] - e < Pr[/ < y] < Pr[/ < y + e] + e. 

^pn fAp ^pn 

Proof. Giyen y, define a (l/£)-Lipschitz function p by 




[1 if t < y, 

T{t) = < if y < i < y + e, 

I 0 if t > y + £. 


It is easy to check that lt<y < <p(0 ^ lt<y+e, and so 


Pr[/ < y] - Pr [/ < y + e] < E [(/?(/)] - E [ip{f)] = -O 

f-Lp t-^pn fdp t-^pn €. 

using Theorem 14.61 and the correct choice of £. We get the bound in the other direction in the same way. □ 


i/y(l -y)n 
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We conjecture that f{vpn) and /(/Xp) are also close in CDF distance. Unfortunately, the method of proof 
of [27l Theorem 3.19(30)] relies on the anticoncentration of multivariate Gaussian distributions [ 8 ], whereas 
both f(vpn) and /(np) are discrete distributions. We consider it an interesting open problem to extend 
Corollary 14.71 to CDF distance. 

Question 4.8. Suppose that p{l — p)n is “large” and d is “small”, compared to n. Is it true that for every 
harmonic multilinear polynomial f of degree d satisfying V[/]i/ „ = 1, the CDF distance between f{i'pn) and 
/(^p) is o{l)? 

4.2 High-degree functions 

Theorem 14.61 requires that d = 0(-\/p(l — p)n). Indeed, Lemma 13.111 which implies that the norm of a 
low-degree function is approximately the same under both fj,p and Vpn, already requires the degree to be 
0{^/p(l — p)n). For d = (jj{\Jp{l — p)n) and constant p ^ 1/2 we exhibit below a 0/ ± 1-valued function / 
which satisfies ||/|lpp = 1 while ||/||i/p„ = o(l). This shows that for constant p 7 ^ 1/2 the dependence on the 
degree is essential in Theorem 14.61 since | [|/|] — lEpp[|/|]| = 11/11^^, — 11/11^^^ = 1 — o(l). We do not know 

whether this dependence is necessary for p = 1/2. Indeed, Lemma 13 .11 1 can be extended above y/n in this 
case, as the calculation below shows. 

Let d = u}{^/p{l — p)n), and assume further that d = o((p(l — We consider the function 

/ = ( 2 p(l-p))-^/ 2 (cri — X 2 ) ■ ■ ■ {x 2 d-i — X 2 d)^ whose Pp-norm is 1 according to Lemma [3.111 The lemma 
also gives its z/^-norm (where k = pn) as 


= (P(l -P)) 


k-{n — k)- 


-,2d 


We estimate this expression using Stirling’s approximation, starting with k-: 


k^ = 


kl 


{k-dy. \k-d 


^ \ k-d+ 1/2 


^g0(l/fc)-0(l/(/c-d)) = 1 + 


k — d 


k—d 


Ud 

^(l± «(!))• 


The Taylor series log(l -t- a:) = a: — x^/2 + 0{x^) shows that 


/ d 



d^ 

v+k-d) 

r 2(fc - d)+ 

= exp 

/-2r + °w] 


and so k- = k’^e /^^(l ± o(l)). We can similarly estimate (n — k)- = (n — k)‘^e ^^(1 ± o(l)) and 

n— = ± o(l)), concluding that 

II/IIL = (p(l ±0(1)) 


= exp 


(-1 -b4p(l -p)) ± 0 ( 1 ) 


2 p(l — p)n 

If p 7 ^ 1/2 is fixed, we immediately conclude that \\f\\vk = o(l). 


5 Approximately Boolean functions 

Theorem [421 only applies to Lipschitz test functions, but in many applications we are interested in functions 
which grow faster, for example the squared-distance-from-{—1,1} function (p{x) = (|a:| — 1)^. In this section 
we show how to handle such functions using hypercontractivity. 
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Proposition 5.1. Denote by || • ||r the norm. For a multilinear polynomial f, let Tp denote the operator 


Tpf = Y,p^r\ 

2=0 


For r > 2 and with respect to pip, 


\\Tpf\\r < II/II 2 , where p = 

For a harmonic multilinear polynomial f, let Ftp denote the operator 


n/2 

i=0 


For r >2 and with respect to Vpn, 

\\Hpf\\r< II/II 2 , where p — _ 2yog(p(i-p)) ~ (p(l — ' 

Proof. The first result is classical, appearing in ISE] for p = 1/2 and in [38l [3T] for general p. The second 
result is due to Lee and Yau [M]. They proved the corresponding log-Sobolev inequality, which implies 
hypercontractivity as shown in [12]. □ 

These results imply that for low-degree functions, the Lr norm and the L 2 norm are comparable. 

Lemma 5.2. Let f be a multilinear polynomial of degree d. With respect to Pp, 

||/||3<0(P(1-P))-°^''^II/I|2. 


If f is also harmonic, then with respect to Vpn, 

\\fh<0{p{l-p))-°<^'^'^\\fh. 

Proof. In both cases, we apply Proposition 15.II to Ti/p/ or to Hi/pf, where p is given by the proposition. In 
the first case, we get 

d 

11 / 1 I 3 < \\r/pf\\i = 

2=0 

In the second case, we get 


WfWl < wuifpfwi = 

i=0 


We obtain the stated bounds by substituting the values of p. □ 

We can now obtain our invariance principle for (p{x) = (|x| — 1)^. 

Theorem 5.3. Suppose that p{l — p)n > 31ogn, and let f be a harmonic multilinear polynomial of degree 
d satisfying d < K^p(\ — p)n such that ||/||i/p„ = 1, where K is the constant in Theorem [23 We have 

I E [(I/I - 1)2] - E[(|/| - 1)2]| = O [(/ , ^ . 

Up \ y y/p[l—p)n J 
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Proof. Note that ||/||!yp„ = 1 implies that < 1 and ||/||^p = 0(1), due to Corollary 13.101 and 

Lemma 13.111 

Let M > 1 be a parameter to be decided, and define the 2(M — 1)-Lipschitz function ip by 


^(.T) = I ^ 

^ ((M- 1)2 if |a;| > M. 

When |x| > M, we have (|a;| — 1)^ < and so with respect to any measure we have 


E[{\f\-lf]<E[iP{f)]+E[fl 
With respect to either jXp or using Lemma 15.21 we can bound 

= ^ii/iii < Lo(p(i -p))-oo). 

Theorem 14.61 therefore implies that 


I E [(I/I - 1)2] - E[(|/| - 1)2]| < MO (J / I + Lo(p(l -p))-0('^). 

Mp yy yjp{l-p)nj M 

Choosing M to be the geometric mean of both terms appearing above results in the statement of the 
theorem. □ 


As an illustration of this theorem, we give an alternative proof of m Theorem 7.5], a Kindler-Safra 
theorem for the slice. 


Definition 5.4. A function / on a given domain is Boolean if on the domain it satisfies / S {±1}- If the 
domain is a cube, we use the term cube-Boolean. If it is a slice, we use the term slice-Boolean. 

Proposition 5.5. Let f be a multilinear polynomial of degree d such that E^p[(|/| — 1)2] = e. There exists 
a cube-Boolean function g on (p(l — p))~^^‘^'^ coordinates such that ||/ — = 0{{p[l — p))~^^'^^e). 

Proof. This is essentially proved in mm]. Explicitly, they prove the same result without the guarantee that 
g is cube-Boolean. In order to get our version, let F = sgn/ and G = sgng. By definition ||F— /||2 = e, and 
so ||F—g||2 = C)((p(l—p))“‘^(‘^)e). Since F is cube-Boolean, this implies that ||F —G||2 = 0((p(l—p))“‘^('^)e). 
We conclude that ||/- G||2 = 0{{p{l - p))-^^'^h). □ 

Theorem 5.6. Let f be a slice-Boolean harmonic multilinear polynomial such that ||/^'^||yp^ = e, where 
There exists a slice-Boolean harmonic multilinear polynomial h depending on (p(l — 
pyf-0{d) coordinates (that is, invariant to permutations of the other coordinates) satisfying 

11/ - h||2^^ < 0((p(l - p))-°('')e) + 0,,d . 

Before we can prove this theorem, we need an auxiliary result m Theorem 3.3], which we prove here 
(simplifying the original proof) to make the proof self-contained. The proof uses a result from Subsection llO.il 

Theorem 5.7. Let f be a multilinear polynomial depending on M < min(p, 1 — p)n variables, and let f be 
the unique harmonic multilinear polynomial of degree at most min(p, 1 — p)n agreeing with f on the slice (|()|) 
(the harmonic projection of f on the slice (|()|)/ Then 
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Proof. The Fourier expansion of / with respect to fip is 


/ = ^ .f{S)ujs, where ws 
SCM 


n 

zGS 


Xj - p 

\/p(l -p) 


The characters los are orthogonal and have unit norm with respect to Pp. This shows that H/H^^ = 

J2scm ■ Harmonic projection is a linear operator, and so denoting the harmonic projection of wg 
by ws, we have 


/-/= E f{S){u:s-Cjs)- 

SCM 


We show below that 

Iks - ^sWl^ = o 

The L 2 triangle inequality then implies that 


p{l-p)n) ■ 


( 1 ) 


II/-/II", E /k)'lks 

SCM 


UJS 


\,.<o 


M22M 

p{l-p)n 


E /k)' = o 

SCM 


( M‘^2 


2nM 


\p{l-p)n 


ll/ll 


2 


It remains to prove ©■ For definiteness, consider S = {1,..., d}, where d < min(p, 1 — p)n. The first 
step is to consider the related function xs = nf=i harmonic projection xs- We will be particularly 

interested in the coefficient of the monomial Y[i=i Xs- According to Corollary 110.41 the coefficient of 
Y[i=i Xs is independent of k, as long a.s d < k < n—d. This suggests considering the harmonic projection 
of nf=i til® sii®® On that slice, nf=i i® the indicator function of the point p = {1,..., d}, and 

we denote its harmonic projection on by Ip. The coefficient of 0^=1 in Ip clearly equals its coefficent 
in Ip^^. Since all monomials in Ip'^ have degree d, this coefficient is simply lp‘^(p). 

Let Ed be the operator projecting a function f) on to The trace of Ed is clearly the dimension 
of the space of harmonic multilinear polynomials which are homogeneous of degree d, which is 
by Corollarv l3.8l Symmetry shows that the diagonal elements of Ed are all equal to 

O - L-,) I d 

O n-d+l 

This is exactly the value of Ip'^(p). We conclude that the coefficient of Oti in Xs is 1 — ^-‘d+i > nnd so 


Xs-Xs = 


XS H-, 


n — d + 1 

where the dots hide a linear combination of other monomials. 

If we substitute Xi := in ys then we get a harmonic multilinear polynomial of degree d which 

Vp(i-p) 

agrees with ws on the slice and so equals Cjs- Substituting this in the preceding formula, we deduce 

that 

d 


OJS — us = 


n — d + 1 


us + 


where tp is orthogonal to us with respect to Pp. 

Since us agrees with us on the slice, we can compute explicitly. Simple estimates show that 

lks||^p„ < 1 + 0( p(il^p)n ) (see [T71 Lemma 4.2] for the details). Corollary 13.101 and Lemma 13 .11 1 imply the 
same bound on |ks||p^. It follows that 


Iks - = 


n — d + 1 


I flp 


;rkTT + i«.-('-;rkTT 


< 


2d 


n — d + 1 


+ 0 


p{l-p)n 


= O 


p{l-p)n 
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This completes the proof. 


□ 


Armed with this result, we can prove Theorem 15.61 

Proof. Let / = f-^ (that is, / = Y.t<d so that [(|/| - 1 )^] < [(/ - f)'^] = e. Notice that / is 

a harmonic multilinear polynomial of degree at most d. Theorem 15.31 implies that 


E[{\f\-lf]<e + 0 




Proposition 15.51 implies that there exists a cube-Boolean function g on a set J of M = 0((p(l — 
coordinates such that €2 = E/ip[(/ — gY] = 0((p(l — The function g is also slice-Boolean, but it 

is not necessarily harmonic. Let g be its harmonic projection on (f'd); this will be our choice for h. Note that 
g also depends only on the coordinates in J, and in particular it has degree at most M (in fact. Lemma 13.171 
implies that degg < deg 5 ). Invoking Theorem 15.71 we see that ||(7 — g||^p = and so 


e3 = \\f-9\\l=0 


p{l — p)n 



Corollary 13. 101 and Lemma 13 .11 1 imply that ||/ —= 0 (e 3 ), using the fact that deg(/ —g) < M. The L 2 
triangle inequality shows that 


II/- 5 IIL < 0 {\\f-f\\l+es) = 0((p(l-p))-«(")e)+0 


^yp{l-p)n 


(p{l-p))-Oi‘i)]+o 


M^2 


2oM 


p{l-p)r 


□ 


The proof of |171 Theorem 7.5] contains an additional argument guaranteeing that degh < d. The same 
argument can be applied here. The idea is that there are finitely many Boolean functions on (p(l — p))~^^'^'> 
coordinates, and each of them of degree larger than d has (as n —>■ 00 ) constant distance from all Boolean 
functions of degree at most d. Hence if e is small enough, g must have degree at most d. We refer the reader 
to [T7] for the complete details. 


6 Multilinear functions 

Theorem 14.61 only applies to harmonic multilinear polynomials. The harmonicity condition is crucial here. 
Indeed, a polynomial such as X]r=i behaves very differently on the Boolean cube (where it has a non-trivial 
distribution) and on a slice (where it is constant). Nevertheless, we are able to recover similar theorems by 
looking at several slices at once. Our first invariance result, which we call norm invariance, states that if two 
low-degree multilinear polynomials /, g are close in L2 norm for enough “well-separated” slices, then they 
are close in L 2 norm over the entire Boolean cube. Our second invariance result, which we call interpolation 
invariance, gives a recipe for constructing the distribution of a low-degree multilinear polynomial / from its 
profile on several “well-separated” slices, where by profile we mean the distribution of / on several coupled 
slices, just as in Section S) 

Our main technical tool is a theorem of Blekherman [^, which states that any degree d multilinear 
polynomial P corresponds uniquely to a degree d polynomial Q in the variables xi,... ,Xn, S such that 

(a) P(xi,..., Xn) = Q{xi ,..., Xn, Xi + ■ ■ ■ + Xn) for any point in the Boolean cube {0,1}". 

(b) For each e, the coefficient of S'® is a harmonic multilinear polynomial (of degree at most d — e). 
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This theorem allows us to reduce the analysis of arbitrary multilinear polynomials to that of harmonic ones. 

We state Blekherman’s theorem in Section [6.II After preparatory work in Section [6^ we prove our norm 
invariance theorems in Section 16.31 and our interpolation invariance theorems in Section 16.41 

There are several principal results in this section, which we now highlight. 

Norm invariance f i36.3l) The main results are Theorem 16.81 which bounds the L 2 norm of a low-degree 
multilinear polynomial on the Boolean cube with respect to its L 2 norm on several well-separated slices, and 
Theorem l6.9l which goes in the other direction, bounding the L 2 norm of a low-degree multilinear polynomial 
on a slice in terms of its L 2 norm on the Boolean cube and the centrality of the slice. 

Both results are combined in Corollary 16.101 which states (roughly) that a low-degree multilinear poly¬ 
nomial has small L 2 norm on the Boolean cube if and only if it has small L 2 norm on several well-separated 
slices. Another conclusion. Corollary 16.111 states that two low-degree multilinear polynomials are close in 
L 2 norm on the Boolean cube if and only if they are close in L 2 norm on several well-separated slices. 

Interpolation invariance( !16.4p The main results are Theorem l6.13l and Corollary 16. 141 which show how 
to estimate the distribution of a low-degree multilinear polynomial on a given slice given its distribution 
on several coupled well-separated slices, and Theorem 16.171 and Corollary 16.181 which similarly show how 
to estimate the distribution of a low-degree multilinear polynomial on the entire Boolean cube given its 
distribution on several coupled well-separated slices. 

These results imply that if two low-degree multilinear polynomials have a similar distribution on several 
coupled well-separated slices then they have a similar distribution on other slices and on the entire Boolean 
cube, as we show in Corollarv l6.16l and in Corollary 16. 191 respectively. 

6.1 Blekherman’s theorem 

Our starting point is a theorem of Blekherman [5] quoted in Lee et al. [23]. For completeness, we prove this 
theorem in Section riO.il 

Theorem 6.1. Let f be a multilinear polynomial overxi,... ,Xn of degree d < n/2, and define S := 

There exist harmonic multilinear polynomials fo,..., fd over Xi,..., Xn, where deg fi <d — i, such that 

d 

f{xi, ... ,Xn) = (mod /), where I = (x? - Xi, ..., - Xn), 

or equivalently, both sides agree on every point of the Boolean cube {0,1}”. Moreover, this representation is 
unique. 

For our purposes, it will be better to consider / as a polynomial in {S — np) jy/np{l — p) rather than in 
S. 

Corollary 6.2. Let f be a multilinear polynomial over xi,... ,Xn of degree d < nj^l, letp S (0,1), and define 
S := (X]r=i ~ ~ P)- There exist harmonic multilinear polynomials fo,..., fd over xi,..., Xn, 

where deg fi < d — i, such that 

d 

/(xi,... ,Xn) ^ . -. ,Xn)5* (mod /), where / = (x^ - xi,..., x^ - Xn), 

or equivalently, both sides agree on every point of the Boolean cube {0,1}". Moreover, this representation is 
unique. 

Proof. Follows from the fact that S and S are affine shifts of one another. □ 
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We call the representation of / in Corollarv l6.2l its Blekherman expansion with respect to p, and /o,..., 
its Blekherman coefficients with respect to p. If we substitute S' = tr in the Blekherman expansion then 
we get a harmonic multilinear polynomial of degree at most d which agrees with / on the slice where 
k = np + a ^/np(r^^j)). We denote this function by /[cr]. Note that this notation depends on p. 

The Blekherman expansion is linear in the sense that if h = af + fdg then he = afe + Ppe and h[a] = 
cxf[a] + Pf[cr]. This immediately follows from its uniqueness and the fact that harmonic functions are closed 
under taking linear combinations. 


6.2 Vandermonde interpolation 

Our arguments will involve extracting the Blekherman coefficients fi given /[cr] for various values of cr. We 
will consider the simple setting in which we are given d +1 values of cr, and in that case the problem translates 
to solving a system of linear equations whose coefficient matrix is a Vandermonde matrix. The quality of the 
reconstruction will depend on the magnitude of the entries in the inverse matrix, which we estimate using a 
result of Turner [3^ . 

Proposition 6.3 (Turner [39]). Tet/i,... be arbitrary real numbers, and consider the nxn Vandermonde 
matrix V given by Vij = where 1 < i,j < n. The inverse ofV is given by V~^ = UL, where 


Lij — 


n 


- Cfe 


if i < j, 

otherwise, 


f 0 ifi>j, 

Utj = 1 ifi=j, 

ifi < j, where C/o(j-i) = 0. 


This proposition implies the following interpolation result. 

Theorem 6.4. Suppose that f is a multilinear polynomial over xi,... ,Xn of degree d < nl2 with Blekherman 
coefficients fo,..., fd with respect to some p € (0,1). Let ai ,..., cr^+i be d + 1 distinct values, and define 


rj = min(l, min [cTj — cr^- j), M = max(l, max jcrjj). 

i 


For 0 < e < d and 1 < i < d + 1 there exist coefficients Cei of magnitude \cei\ < (2M/riY such that for all 
0 < e < d, 

d+l 

fe = X!'=ez/[crz]- 
i=l 

Proof. Let V be the Vandermonde matrix for ai,..., (Xd+i, so that 


’ fwl] ' 


fo 


= V 


J[<^d+l]_ 


fd_ 


Inverting V, this shows that 


’ /[o-i] ’ 


fo 

fWd+i]_ 


Jd_ 


We can thus take Cei = It remains to bound the magnitude of the entries of V 

Let L, U be the matrices in Proposition 16.31 The formula for L implies that all of its entries have 
magnitude at most (l/yy)'^. As for U, we will prove by induction that when i < j, \Uij\ < . This 

is clearly true when i = j. When i = 1, \Uij \ = 01=1 lo'fcl ^ Mi~^. The inductive step follows from 





Mi-* + M ■ 




Mi-\ 


It follows that all entries of U are bounded in magnitude by ■ M'^. The theorem follows. 


□ 
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6.3 Norm invariance 


We are now ready to prove our norm invariance principle. Our argument will require a few auxiliary results. 
We start with an estimate on the central moments of binomial distributions. 

Lemma 6.5. Let S ^ B{n,p) and S = . For all d>0, 

v"p(i-p) 




2d\ 

{2p{l-p)Y' 


Proof. Hoeffding’s inequality states that 

Pr[|S' - np\ >t]< 2 e- 2 *"/« ^ Pr[| 5 | >t]< 2 e- 2 p(i-p)‘'. 

Plugging this in the general formula E[|X|] = Pril-^l > t] dt, we get 

cOO cOO 

^ / Pj.[^ 2 d / e- 2 p(i-p)P^‘' dt. 

Jo Jo 

Substituting s = 2p(l — p)t^^'^, we have t = (s/2p(l — p))‘^ and so dt/ds = d{s/2p{l — p))‘^~^/2p{l — p), 
implying 

poo 

< 2 . ( 2 p(l - p))-‘^ / ds. 

Jo 

Since (e“®F(s))' = e“'*(P'(s) — P{s)), it easily follows that f e~^Q(s) = —e~^(Q(s) + Q'(s) H-), which in 

our case gives 

E[^2d] < 2 . (2p(l - p))-"^ (-e-0 [ds‘^~^ + did - + • • • + d!] |“ = 2 • (2p(l - • d!. □ 

We comment that for every hxed p and d, as n —>■ oo, converges to E[N(0,1)^"^] = • 

We also need an anti-concentration result for binomial distributions, which follows from the Berry-Esseen 
theorem. 

Lemma 6 . 6 . Let S ~ B(n,p) and S = —r=r^=- For all a < b, 

y/npil-p) 

Pr[5 G ia,b)] > ^ 

VnpO-p) 

Proof. The Berry-Esseen theorem states that the cumulative distribution functions of S and of the standard 
normal distribution N(0,1) differ by at most Cp/a^^Jn, where C < 1 is an absolute constant, p = E[| Ber(p) — 
Pl^] = pO ~ P)iP^ + (1 ~ and cr^ = V[Ber(p)] = p(l — p). The result follows from p < p{l — p)/2. □ 

We are now ready to tackle norm invariance. We start by bounding the norm of / on the Boolean cube 
given its norm on several well-separated slices. 

Definition 6.7. Fix p G (0,1) and n. A set 0 < ki,... ,kr < n \a said to be an ( 77 , M)-system, for 77 < 1 
and M > 1 , if the following two conditions hold for ai = ■ 

^Jnp(l-p) 

(a) For every i ^ j, |cri — aj \ > 77 . 

(b) For every i, \ai\ < M. 
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Theorem 6.8. Letp £ (0,1), and let f be a multilinear polynomial overxi,... ,Xn of degree d < y/np{l — p). 
Let ki,... ,kd+i be an {p^M)-system, where M < ^Jnp{l — p)/2, and suppose that 11/11^^. < 1 for all 
\ < i < d 1. Then 


I ftp 


= O 


pil-p) 


0 (d) 

V ) 


Moreover, the Blekherman coefficients /o,..., /d of f with respect to p satisfy, for 0 < e < d, 


ll/elU, < OiM/pf. 


Proof. If c? = 0 then / is constant, so we can assume that d > 1. 

We are given that for each i, ||/[cri] < 1. Since fieri] is a harmonic multilinear polynomial of degree 

at most d, Corollary 13.101 and Lemma [3.111 show that for q = ki/n, 


ll/WII? < - !>))“ 


I ftp - 


(2g(l - q)y 


1-0 


q{l - q)n 


-1 


0 ( 1 ). 


( Pit-p) \ 

U(i-< 7 )y 


Since M < \Jnp{l — p)/2, we have 


M^np{l-p) p{l-p) 

\p-q\< ---< —^. 


which implies that 


< 


< 2 . 


q p-p{l-p)/2 1+p 

Similarly, (1 — p)/{1 — q) < 2. This shows that 

Theorem 16.41 shows that there exist coefficients Cei of magnitude at most {2M/pY such that for each e, 
fe = Cei fieri]. It follows that for each e, 

d+l 

ii/eiu, < E ice.iii/Niu. < (d+1) • o{M/pr ■ o{ir = o{m/pY. 


Since / = the Cauchy-Schwartz inequality implies that 


\fip 


d d _ 


e=0 


e=0 


Lemma [6l5] shows that 


= fns^‘^] = o 


ftp 


d 

p{i-p)) 


Hypercontractivity, in the form of Lemma 15.21 implies that with respect to pp, 


^/m\ = ll/eiu < 0(p(l -p))-°(^)||/e ||2 = o 
In total, we deduce that 


M 


p{l-p)p 


o{d) 


I f^p 


<{d + l)-0 


■O 


pi^-p)j \pit-p)v 


M 


0{d) 


= o 




p{l-p) p J 


□ 
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We can also go in the other direction. In the statement of Theorem 16.91 and similar results below, we 
allow big O constants to depend on the fixed value of p (and below, on the fixed value of other parameters). 

Theorem 6.9. FixpG (0,1), and let f be a multilinear polynomial over Xi,... ,Xn ofdegreed< \/iogfnp(W^py730j 
satisfying ||/||^j, <1. If k = np + ^np{l - p) ■ a then 


li'fe — ® 


0{<F)\^\d d\t7\/y/np(l-p) 


Proof. If d = 0 then / is constant, so we can assume that d > 1. 

For 0 < J < d, let Oi = z — 1/2 and bt = i. Lemma [6^ shows that 


Pr[5 e {a^,bi)] > 


1 


2 \/^ 


= _ 


1 


^/npil-p) 


The first summand is at least 


1 


2^ 


„-log[np(l-p)/30]/2 


> 


1.09 


2\/^ ^np{l -p)/30 ^np(l - p)’ 


and so 


Pr[,S G {ai,b^)] = ^ /^). 

In particular, the norm of / restricted to 5 G {ai,bi) is 0{e‘^ /^), and so there must exist ki = np + 
^/np{l -p) ■ (7i such that cr^ G {ai.bi) and \\f\\vk, = 0{e^ /^). 

The resulting system ko,...,kd is a (1/2,d)-system, and so Theorem 16.81 shows that the Blekherman 
coefficients fn,..., fd satisfy 

||/elU,<0(e‘^'/^)-0(d)-^ = e°(-^^). 

Substituting a given value of u, we deduce that 




e=0 


Corollary 13.101 and Lemma [3.111 show that for q = k/n, 

df 


' (2p(i-p)y/T‘ + o 


1/2 


^p(l -p)n 

In order to estimate the final factor, note that q = p + \j o’ and so 


= ) • 


p(i-p)) 


^ = 1 + 


1 — p a 

p y/n 


< eV < g<T/\/"p(i-p)_ 


Similarly (1 - g)/(l - p) < e-'^/V"P(i-p). Therefore 


( q{l-q) y'\ d\a\u/F^(TF^) 

Ui-p); - 

and the theorem follows. □ 

We can combine both results to obtain the following clean corollary. 

Corollary 6.10. Fix p G (0, 1) and d > 1, and suppose that IF is a collection of multilinear polynomials of 
degree at most d on at least variables (different functions could depend on a different number of 

variables). The following three conditions are equivalent: 
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(a) There exists a constant Ci such that ||/||/jp < Ci for all f ^ T. 

(b) There exists a constant C 2 such that \\f\\^i, < {C 2 Y all f G iF over xi,...,x„ and for all 
0 < k < n. 


(c) There exists a constant C 3 such that WfW^k < C 3 for all f G F over xi,...,Xn and for k = [np+ 
^ynp{^-p) • crj for a G {0,, d}. 


Proof. Suppose first that condition (|aj) holds, and let a 
Theorem 16.91 shows that 


k—np 

\/np(l-p) 


, SO that |cr| < ^njpfX — p). 


Then 


ll/IU, < Cl • 


which implies condition (lb)) . 

Suppose next that condition © holds. In particular, for ui,..., 0-^+1 = 0,..., d and ki = np + 
^ynp{l — p) ■ Gi it is the case that |l/|Ufc. < {C 2 d^Jp{l — p)Y, which implies condition (jcj) with = 
{C 2 d^p{i-p)Y. 

Finally, suppose that condition (jc|) holds. Since 0,1,... ,d is a (1,(i)-system. Theorem 16.81 shows that 


iMp 


= o 


p{l-p) 


o{d) 


C3 


which depends only on p and d, implying condition f[aj). 


□ 


Here is a different interpretation of these results. 

Corollary 6.11. Fix p G (0,1) and d > 1. There is a constant C > 0 such that the following implications 
hold for any two multilinear polynomials of degree at most d on n> variables: 

(a) IfWf -gWup = e then for all 0 <k<n, ||/ - 5 IU, < 

(b) Ifki,..., kd+i is an {p,M)-system and \\f - g\\i,^, <eforl<i<d+l then \\f - g||^p < {CM/p)^^‘‘'>e. 

Proof. The first statement follows directly from Theorem 16.91 (using |tT| < Yn/p{l — p)), and the second 
statement follows directly from Theorem 16.81 In both cases, we apply the theorems to f — g. □ 


6.4 Interpolation invariance 

We move on to the interpolation invariance principle. Definition 14.11 describes a coupling X(0),... ,X(n) of 
the distributions z/Qi • ■ • i which we will use to define the profile of a function. 

Definition 6.12. Fix parameters p G (0,1), d,n> 0, and let fci,..., be an ( 77 , M)-system. The profile 
of a multilinear function / of degree at most d with respect to the system fci,..., k^+i is the joint distribution 
of /(X(fci)),... ,/(X(fcd+i)), which we denote by fi,... ,fd+i. 

The profile of a function / allows us to recover its distribution on arbitrary slices, as reflected by Lipschitz 
functions. 

Theorem 6.13. There exists a constant K > 0 such that the following holds. Let p G (0,1), let f be a 
multilinear polynomial on xi,..., of degree 1 < d < YKnp(l — p), let ki,..., k^+i be an {rj, M)-system 
for M < \Jnp{\ — p)/2, and let fi,...,fd+i be the profile of f with respect to this system. Suppose that 

ll/lUfe. < 1 for 1 < f < d+1. For every slice k = np+ ~ p)'0 ' such that L := max^ |cr —cr^ | < ^ -, 

and for any Lipschitz function (p, 

/ I T 

|E[<p(/)]-E[(^(f[/c])]| =0 (l + |cr|)—) W , =, w/iere f[fc] = 

V d / V v^np(l-p) ^ 
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for some constants ^ki depending on p, n, d satisfying 


l7fe^l 


, M 

O (l + |a|)- 

' T] 


Proof. Let pi = ki/n = p + \ (where tJi = '^p ) Jet q = k/n = p P The 

V ^Jnp{l—p) V 

condition on L guarantees that \pi — q\ = — o'! < The condition on M guarantees 

that \pi — p\ < < p{l — p)/2, and so \p{l — p) < Pi(l — Pi) < |p(l — p)- In particular, 

\Pi — q\ < • Lemma(applied with p := p^ and q ■= q) shows that 


E[|/(X(fc.)) - /(X(fc))|]2 < E[(/(X(fc.)) - /(X(fc)))2] = O 


dL 


y/np{l-p) 


Let fo,..., fd be the Blekherman coefficients of /, and let Cei be the coefficients given by Theorem 16.41 
so that \cei \ < (f2M/rf)^. Theorem 16.41 shows that for 0 < e < d, 


d+l 


d+1 


E[|/,(X(fc))-^CeJ(X(fc,))|] < ^|ce.|E[|/(X(fc,))-/(X(fc))|] < (d+l). 


2=1 


2=1 


2M 


J' 


O 


dL 


y'npil-p) I 


Since f[a] = X]e=o conclude that 


E[|/(X(fc)) - E E Ce./(X(fc.))l] <id+ 1)2 • (1 + Wf) ■(—) -olJ 

e=0 i=l V d / Y y 


dL 


y^np{l-p) 


Rearrangement yields the statement of the theorem, with 'jki = X)e=o n 

We immediately obtain a corollary for the Levy distance. 

Corollary 6.14. Under the setting of Theorem \6.1S\. the Levy distance between f{i'k) cind f[A:] is at most 


O 


1 + cr )— ) 
T] J 


L 


^npil-p) 


Proof. The proof is identical to the proof of Corollary 14.71 


□ 


A striking form of this corollary compares two different profiles. 

Definition 6.15. Let X = {Xi,..., Xr),Y = (Yi,...,W) be two r-dimensional real distributions. The 
Levy-Prokhorov distance between these distributions is the infimum value of e such that for all Borel sets A, 

1. Pr[X € A] < Pr[y e A*^] + e, and 

2. Pr[y e A] < Pr[X e A'] + e, 

where A*^ consists of all points at distance at most e from A in the L°° metric. 

In fact, for all our results it suffices to consider halfspaces for A, that is, sets of the form {{xi,... ,Xr) : 
X)i=i ^i^i Y b}. If we restrict the definition of Levy-Prokhorov distance in this way, then it coincides with 
the usual Levy distance in the one-dimensional case. 
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Corollary 6.16. Under the setting of Theorem 1 6 ‘. 13\ for two functions f,g, if the Levy-Prokhorov distance 
between the profiles of f and g is e, then the Levy distance between f{i'k) and g{i'k) is at most 


d /2 

o((i + H)-j 


y'npil-p) 


+ o|(i + H)^ 


e. 


Proof. We start by bounding the Levy distance between f[A:] and g[A:]. Given t, define the halfspace At by 


d+i 

At = {(xi,.. .,Xd+i) : < t}, 

i=l 


and notice that C At+Be, where 


B = 'p\lk^\=o({l + \a\)—] . 

i=l ^ ^ ^ 


Since the Levy-Prokhorov distance between the profiles of / and g is at most e, it follows that 


PT[t[k] <t]= Pr[(fi,..., fd+i) e At] < 

Pr[(gi,... ,gd+i) G ^t] + e < Pr[(gi,.. .,gd+i) G At+Be]+e = Pr[g[A:] <t + Be] + e. 

This shows that the Levy distance between f[fc] and g[fc] is at most Be. 

The result now follows from Corollary [6T4] and the triangle inequality for the Levy distance. 


□ 


By combining different slices, we can obtain a similar result for /ip. 


Theorem 6.17. There exists a constant K > 0 such that the following holds. Let p G (0,1), let f be a 
multilinear polynomial on Xi,..., x„ of degree 1 < d < K^np{l — p)! log[np(l — p)], let ki,..., kd+i be an 
{rj, M)-system for M < ^np{\ — p)/{Qd), and let ii,..., f^+i be the profile of f with respect to this system. 
Suppose that H/Hiy^. < 1 for 1 < / < d + 1. 


For s := •\/31og[np(l — p)], define a distribution f as follows: choose a ^ 
\a\ < s, and let f = f[o'], as in Theorem \6. 13[ Then for any Lipschitz function gy, 

0(d) 


B(rt,p) np conditioned on 
V”P(i“P) 


\E[g,if)]-E[g,ii)]\=0 


M\ 


1 


pil-p) gj f/np{l - p)' 


Proof. We can assume, without loss of generality, that (/^(O) = 0. 

Since M + s < due to the bound on M and our choice of s. Theorem 16. 131 implies that 


E 


k—np-\--\ynp{l—p)-a: 
|cr|<S 


Pr[B(n,p) = fc]| £[(/?(/)] -E[(/)(f[cr])]| = 0' ^ 

V ?7 


^np{l-p) 


E[{lp\a\Y./WTW\]. 


where a ^ B(ri,p) np ^ \^u\ < 1, C < and when |(t| > 1, C < {y/M + -y|^) < 

V"P(i“P) 

2 d+i|^| 2 dyC^^ Therefore Lemma 16751 implies that 


£[(7] < 2^^+!VM(1 + E[|crp‘^]) = O 


pil-p) 


d 

\fM. 
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We conclude that 


E 

k—np-\-y^np{l—p)-(T: 
|cr|<S 


Pr[B(n,p) = k] E[ip{f)] - (1 


ei)E[<p(f)] 


\p{l-p) 


mV 1 

V ) ^np{l-p)' 


where ei := Pr[|B(n,p) — np\ > \Jnp{l — p) ■ s]. Dividing this bound by 1 — ei and using 1 — = 0(ei), 

we deduce that 


\E[ip{f)]-E[ipii)]\<0{e^)EMf)] 

Mp 


[< P (/)1 


I -’T'PI >S\/ np{l-p) 




pil-p) 


mV 1 

V ) ^np{l-p) 


As in the proof of Theorem 14.61 Proposition 03] implies that ei < 2e Theorem 16.81 shows that 


, =0{^ _ 

\p{l-p) vj 


0(d) 


This allows us to bound the first error term above, since £[</?(/)] < E[|/|] < ||/|1. The other error term is 
bounded in the proof of Theorem 14.61 bv 0(e“® Altogether, we obtain 


E[<p(/)]-E[<^(f)]|<0 


p{l-p) rj J 


0 (d) 


^np{l-p) 


+ e 


-s^/12 


Substituting the value for s, we deduce that 

|E[^(/)]-E[^(f)]|<0 


M\ 


0{d) 


p{\-p) pj ^npil-p) 

Just as before, we can obtain corollaries for the Levy distance. 

Corollary 6.18. Under the setting of Theorem \6.17\ the Levy distance between f{pp) and f is 


□ 


O 


M\ 


0(d) 


.P(l-P) p) ^np{l-p)' 
Proof. The proof is identical to the proof of Corollarv l6.14l 


□ 


Corollary 6.19. Under the setting of Theorem \6.1'T\ for two functions f,g, if the Levy distance between the 
profiles of f and g is e < 1/2, then the Levy distance between fipp) and g{pp) is at most 

r.( d 1 l log{l/e) M\\ 

\pii-p) p) ^np{i-p) yip{i-p)pj ■ 

Proof. As in the proof of Corollary 16. 161 the idea is to bound the Levy distance between f and g. Let cr have 
the distribution in Theorem 16. 171 and let t be a threshold to be determined. The argument of Corollarv l6.161 
shows that 

Pr[f [o’] < t] < Pifeio] < t + B^e] + e, where = O 

Therefore 

Pr[f <f\= E[Pr[f[(T] < t]] < E[Pr[g[cr] < t + B^e]l\„\<T] + Prljo-I > t] + e 

< Pr[g < t + Brc] + Pr[|(T| > r] + e. 
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As in the proof of Lemma 16.51 Hoeffding’s inequality shows that 

Pr[|o-| > r] = Cl(Pr[|5'| > r]) = 

showing that the Levy distance between f and g is at most + e). Choosing t = 

\/ 2 p^i^-p} ’ conclude that the Levy distance between f and g is at most 


O 


/log(l/e)M 


e. 


p{l-p) r] ^ 

The result now follows from Corollary 16.181 and the triangle inequality for the Levy distance. 


□ 


Theorem 16.171 states that we can recover the distribution of a low-degree multilinear polynomial on the 
Boolean cube from its distribution on a few coupled slices. Interpolation in the other direction is not possible: 
the distribution of a low-degree polynomial function on the Boolean cube doesn’t determine its distribution 
on the various slices. For example, consider the following two functions, for even n: 

_ ELi _ Erii(^2i-i - X2^) 

^ \/np(l - p) ’ ^ \/np{l - p) 

The central limit theorem shows that with respect to fip, the distribution of both functions is close to N(0,1). 
However, on the slice Vpn, the first function vanishes, while the second function also has a distribution close 
to N(0,1), due to Theorem 14.61 


7 Functions depending on few coordinates 


In this section we prove an invariance principle for bounded functions depending on o(n) coordinates. In 
contrast to our work so far, the functions in question need not be harmonic multilinear polynomials. The 
invariance principle immediately follows from the following bound on total variation distance. 

Lemma 7.1. Let p{l — p)n —> oo and m = o(p(I — p)n). Denote the projection of Pp and Vpn into the first 
m coordinates by and The total variation distance between p'^ and is o(I). 

Proof. Let k = pn, and consider the ratio p{£) between the probability of a set of size £ under and under 

, iiM = (V-'b/O 

To understand the behavior of p{£), we compute the ratio p[£ + 1)/p{£)-. 


p{£ -fl) I—p k — £ 

p(£) p n — m — k + i+1 

Thus p{£ + I) > p[£) iff (I — p){k — £) > p{n — m — k + £ + 1) iff€< p{m — I). We deduce that the largest 
value of p{£) is obtained for £q = pm (assuming for simplicity that this is indeed an integer). 

At the point £q we can estimate, using Stirling’s approximation, 


p(4) 


/ n—m \ 

^pn) 

p-p(n-m)(^_p)-(l-p)(n-m) ^ / p( 

\/2irp(l-p)n 


n — m 


.gO(l/p(l-p)(n-m)-0(l/p(l-p)n) _ Q 


p{l-p)n 
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Altogether, we deduce that for all p{£) < 1 + 0{m/p{l — p)n). Therefore the total variation distance is 


■ ^ / e-, p(e)>i ^ ^ ' 


□ 


p(i)>i 


As an immediate corollary, we obtain an invariance principle for bounded functions depending on o(n) 
coordinates. 

Theorem 7.2. Let f be a function on {0,1}" depending on o{p{l—p)n) coordinates and satisfying ||/||oo < 1- 
As p{l — p)n —?■ oo, 

I E[/]-E[/]|=o(l). 

^pn fAp 

Proof. Suppose that / depends on m = o(p(l — p)n) coordinates. Applying the triangle inequality shows 
that 

EJ/]-E[/]|< Knix) - f^pix)\\f{x)\ < Knix) - f^pix)\, 

a:G{0,l}'" rcG{0,l}'" 


the last expression being exactly the total variation distance between ly' and p' 


□ 


Corollary 7.3. Let f be a function on {0,1}" depending on o(p(l — p)n) coordinates. As p{\ — p)n —>■ oo, 
the CDF distance between Vpnif) and Pp{f) tends to zero, that is, 

sup I Pr[/ < t] - Pr[/ <t]\ = o(l). 

Mp 


Proof. Consider the functions ft = l/<t: which satisfy ||/t||oo = 1 for all t G R. 


□ 


8 Functions with low total influence 

In this section we prove an invariance principle for Boolean functions whose total influence is o(y^p(l — p)n). 
In other words, we show that Boolean functions with total influence o{^yp{l — p)n) cannot distinguish the 
slice from the cube. 

For us a Boolean function is a function /: {0,1}" —>■ {0,1}, and total influence is defined as follows: 
Inf[/] = p(l — p)nPr[/(a:) ^ /(y)], where x pp and y is a random neighbor of x, chosen uniformly from all 
n neighbors (this differs from the normalization in [17] by a factor oip{l—p)). As in Section [3 the functions 
we consider in this section need not be harmonic multilinear polynomials. 

The upper bound \/p{l — p)n is necessary. Indeed, consider a threshold function with threshold pn. This 
function has total influence Q{y/p{l — p)n), is constant on the slice, and is roughly balanced on the cube. 
The condition that the function is Boolean is also necessary (total influence can be extended to arbitrary 
functions, see for example [29l Section 8.4]). The function (xi + • • • + a:„)/-\/p(l — p)n has unit variance 
and unit total influence on the cube, is constant on the slice, and has a non-trivial distribution on the cube. 
Invariance does hold, however, for non-Boolean harmonic multilinear polynomials of degree o{^p{l — p)n), 
as Theorem 14.61 shows. 

We will use part of the setup of Definition 14.11 namely the random variables X(s) G (fo^) for 0 < s < n. 
We also define two new random variables: s ^ B(n,p) and t ~ B(n — I,p), where B(n,p) is the binomial 
distribution with n trials and success probability p. The basic observation is that X(s) is distributed 
uniformly on the cube {0,1}” with respect to the measure Pp whereas TA(np) is distributed uniformly on the 
slice and so our goal would be to bound the probability Pr[/(X(s)) ^ /(X(np))]. 

The following lemma bounds Pr[/(X(s)) ^ /(X(np))] using a hybrid argument. 
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Lemma 8.1. For every Boolean function f, 


Pr[/(X(s)) + f{X{np))] < 

n—1 np—1 

^ Pr[s > s + 1] Pr[/(X(s)) ^ /(X(s + 1))] + ^ Pr[s < s] Pr[/(X(s)) ^ /(X(s + 1))]. 

s—np s—0 

Proof. For an event E, let |i?] denote the corresponding indicator. Let I denote the interval whose endpoints 
are s and np (inclusive). Then 

I/(X(s)) ^/(X(np))l < ^ I/(X(s))^/(X(s + l))] = 

{s,s+l}CI 

n—1 

^[{s, s + 1} c II . I/(X(s)) ^ f{X{s + 1))1. 

s^O 

Since I is independent of X, taking expectations we get 

n — 1 

Pr[/(X(s)) ^ f{X{np))] < Y, Pr[{s, s + 1} C I] Pr[/(X(s)) ^ f{X{s + 1))] 

s^O 

n—1 

= ^ Pr[s > s + 1] Pr[/(X(s)) + f{X{s + 1))] 

S—np 

np— 1 

+ ^ Pr[s < s] Pr[/(X(s)) + f{X{s + 1))]. □ 

s=0 

We can write the total influence in a very similar form. 

Lemma 8.2. For every Boolean function f, 

n—1 

Inf[/] = ^p(l -p)nPr[t = s] Pr[/(X(s)) f{X{s + 1))]. 

s=0 

Proof. Let (x, y) be a random edge of the cube oriented so that \x\ < |j/|, where \x\ is the Hamming weight of 
X. It is not hard to check that {x,y) ^ (X(t), X(t + 1)). This directly implies the formula given above. □ 

In order to bound Pr[/(X(s)) ^ f{X{np))], it remains to analyze the ratio between the coefficients in 
the two lemmas. 

Lemma 8.3. //p(l — p)n —> oo then for every Boolean function f, 


Pr[/(X(s)) ^ f{X{np))] = O 


\Vpi^-ph 



Proof. In view of Lemma 18.11 and Lemma 18.21 it remains to bound from above 


the following two ratios: 


Pi{s) 

P2{s) 


Pr[s > s + 1] 
p{l — p)nPr[t = s] 
Pr[s < s] 

p{l — p)n Pr[t = s] 


{pn < s < n — 1 ), 
(0 < s <pn — 1 ). 
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It is not hard to check that 

Pr[t > s] Pr[t < s] 

^ “ p(l — p)nPr[t = s] ’ ^ “ p(l — p)nPr[t = s] 

Log-concavity of the binomial coefficients shows that pi is decreasing while p 2 is increasing, and so in the 
given ranges, pi{s) < pi{np) and P 2 (s) < p 2 (np). It is known that the median of the binomial distribution 
B(n — l,p) is one of np — l,np, and so the numerator of both pi{np) and P 2 {np) is 1/2 ± o(l). The local 
limit theorem shows that the common denominator is (1 ± o(l)) Y^p(l — p)n/2'K. Therefore 


Pi(s),P2(s) < 


2 p{l — p)n 


□ 


Our main result in this section immediately follows. 

Theorem 8.4. If p{\ — p)n —>■ oo and f is a Boolean function satisfying Inf[/] = o{i/p(T--pjn) then 

|E^J/]-E.,„[/]|=o(l). 

As a corollary, we prove that balanced symmetric Boolean functions cannot be approximated by Boolean 
functions whose influence is o{^yp{l — p)n). 

Proposition 8.5. There exists a constant d > 0 such that for all p G (0,1) and large enough n, if f is a 
symmetric Boolean function such that i [/] < | then Pr^^ [f d] > ^ for every Boolean function g 

satisfying Inf[g] = o{^yp{l - p)n). 

Our result can be considered a generalization of two recent results (with much weaker bounds): 

• O’Donnell and Wimmer [30] proved much tighter results (in terms of d) in the special case of the 
majority function. 

• Tal [37] proved tighter bounds (in terms of <5) for general symmetric functions assuming g is an AC*^ 
circuit whose influence sum is o(Y^p(l — p)n). 

Proof. It is known that the total variation distance between pp and pq in the regime \p — q\ = o(l) is 
0{\p — q\\J p{i-^ )- Choose C > 0 so that \p — q\ < C^p(l — p)/n implies that the variation distance 
between pp and pq is at most 1/4. Call a q satisfying this bound good. 

Let 5 be a Boolean function satisfying Inf[ 5 ] = o{^p{l — p)n). Suppose that Pr^^p)/ ^ g] < 5. If 
X ^ Pp then \x\/n is good with constant probability, and so there exists a slice with q good such that 
Pr.,„[/ 7 ^ 5 ] = 0 (d). 

Theorem 18.41 implies that |E^_j[ 5 ] —= o(l). On the one hand, — ^„^„[f]\ = 0{S), 

and so the symmetry of / implies that either E^^^[g] = 0{S) or ¥.y^.^[g\ = 1 — 0{5). On the other hand, 
- ^rp[f]\ < I IE;., [5] - E;.p[g]| + |E;.p[5] - E^p[/]| < 1/4 -t S. We conclude that either E^p[/] < 
1/4 -I- 0{6) + 0 ( 1 ) or E^p[/] > 3/4 — 0(i5) — o(l). For an appropriate choice of 6 , this contradicts the 
assumption 1/3 < E^p [/] <2/3 for large enough n. □ 


9 Decomposing the slice 

In this section we describe two ways of decomposing the space of functions over the slice. One decomposition 
arises naturally from the harmonic representation: / = X]d<n /2 PP® corresponding decomposition of 
the space of functions is into the subspaces of homogeneous harmonic multilinear polynomials of degree d, 
for 0 < d < n/2. We call this decompositon the coarse decomposition, and discuss it further in Section l9.ll 
The coarse decomposition can be refined into a distinguished basis of the space of functions over the slice, 
known as the Gelfand-Tsetlin basis, which is described in Section [9.21 
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9.1 Coarse decomposition 

Every harmonic function / can be decomposed as a sum of homogeneous parts: 

/ = E 

d<nj 2 

This decomposition naturally arises from the representation theory of the slice. In this subsection we discuss 
this connection, and obtain a representation-theoretic proof of Theorem 13.91 as a result. 

So far we have viewed the slice as a subset of the Boolean cube. However, it is also possible to view the 
slice as the set of cosets of Sk x Sn-k inside Sn- In other words, we identify a subset A S with 
the set of permutations tt such that 7 r({I,..., k}) = A. From this point of view it is natural to consider the 
action of S'„ on by permutation of the coordinates. The resulting module is isomorphic to the 

permutation module (assuming k < n/2) whose decomposition into irreducible modules (Specht 

modules) can easily be computed to be 

~ ]\j{n-k,k) ^ g(n) ^ ^(n- 1 , 1 ) 0 ... 0 g(n-k,k) 

Since each irreducible representation appears once, {Sn, Sk x Sn-k) forms a Gelfand pair. Classical represen¬ 
tation theory of the symmetric group (for example, [3S1 §2.9-2.10]) allows us to identify with the 

space of all homogeneous harmonic multilinear polynomials of degree d over the slice. 

Consider now the middle slice (uniquely identify every harmonic multilinear polynomial 
on xi,...,Xn with a function over the middle slice. If f,g are harmonic multilinear polynomials and a 
is an exchangeable measure then Ecj[/gr] = f'^Ag, where A commutes with the action of Sn (since a is 
exchangeable). Since A commutes with the action, Schur’s lemma implies that it acts on each irreducible 
g(n-d,d) scalar multiplication (since each irreducible representation appears only once). This implies that 
for some constants Ag,..., A[„/ 2 J depending only on a, 

fAg= Y. Ad(/=^r"), 

d<nj 2 



where f~‘^ is the component of / in (defined uniquely since there is only one copy of this irreducible 

representation) and the inner product is given by {x,y) = x'^y; there are no mixed summands {f^‘^,g^^) 
since the decomposition into irreducible representations is orthogonal. Theorem 13.91 follows by taking, for 
each d < n/2, 

\\{xi - X2) . ■ . {x2d-l - X2d)V 

A similar argument explains why the decomposition into homogeneous parts appears in some of our 
other results such as Lemma [3.121 and Lemma 13.141 More generally, if A is any operator on which 

commutes with the action of Sn then 


Af= Y 

d<m.in(A;,n—fe) 


where Ag,..., Ad are the eigenvalues of A. 

The decomposition into homogeneous parts appears in several other guises: 

Association schemes The operators on K.[(f^l)] which commute with the action of Sn form a commutative 
algebra knows as the Bose-Mesner algebra of the Johnson association scheme. This algebra has two 
important bases: the spatial basis corresponds to the characterization of the algebra as the set of 
matrices A in which A{S,T) depends only on fl T|, and the spectral basis corresponds to the 
decomposition into homogeneous parts. For more on association schemes, see Bannai and Ito [2]. 
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Regular semilattices and spherical posets For each k, we can consider the subset of the n-dimensional 
Boolean cube consisting of all sets of cardinality at most fc as a lattice under the set inclusion order. 
The truncated Boolean lattice is an example of a regular semilattice and, when k < n/2, of a spherical 
lattice. Delsarte m and Stanton [36] gave general constructions which recover the decomposition of 
into its irreducible components. For more, see Ceccherini-Silberstein et al. [HI § 8 .2-8.4]. 

Differential posets We can also consider the entire Boolean cube as a lattice. The Boolean lattice is an 
example of a pL-dijferential poset. Stanley [35] gave a general construction, which in the special case of 
the Boolean lattice decomposes it into the various slices, and each slice into the irreducible components. 
This decomposition corresponds to the Terwilliger algebra of the binary Hamming association scheme. 
For more on this area, see Engel [THl §6.2]. 

9.2 Gelfand—Tsetlin basis 

So far we have discussed the decomposition of each function on the slice into its homogeneous parts. This 
decomposition has a similar counterpart for functions on the Boolean cube, the so-called levels of the Fourier 
expansion. For functions on the Boolean cube, the decomposition can be refined to the Fourier basis. The 
Fourier basis is uniquely defined (up to scalar multiplication) as the set of characters of the group In 
the case of the slice, we can obtain such a canonical basis by specifying an order on the coordinates, say the 
standard order xi,... ,Xn- 

Recall the decomposition into irreducible 5'„-modules of discussed in the preceding subsection: 

j^{n-k,k) ^ g{n) 0 ... 0 g{n-k,k) 

If we consider (the space of homogeneous harmonic multilinear polynomials of degree d) as a module 

over Sn-i rather than over Sn, then it is no longer irreducible. Instead, it decomposes into two irreducibles: 
g{n-i-d,d) 0 ^(ra-d.d-i)^ we then consider each of these irreducibles as modules over Sn -2 they decompose 
even further, and continuing in this way, eventually we get a decomposition into irreducibles of dimension 1 . 
The corresponding basis (defined up to scalar multiplication) is known as the Gelfand-Tsetlin basis. 

Srinivasan [34] described an inductive construction of this basis using ideas from Sperner theory, which 
are related to the theory of differential posets mentioned above. Ambainis et al. [ 1 ] gave an inductive 
construction which closely follows the definition, in the context of quantum computation. Filmus m gave 
an explicit formula for the basis elements, which we now describe. 

A sequence (oi, bi), ..., (a^, bd) is admissible if: 

1 . All 2d numbers are distinct and belong to [n]. 

2 . bi < ■■■ <bd. 

3. Oi < bi for all i. 

A set B — {bi ,..., bd} (where bi <■■■< bd) is admissible if it can be completed to an admissible sequence 
(equivalently, it is the bottom row of a standard Young tableau of shape (n — d,d)). There are (()) — (dli) 
admissible sets of size d. For each admissible set B, we define a basis element 

XB= {Xa,-Xb,)---{Xa^-XbJ. 

ai,...,ad : 

{ai,bi),...,{ad,bd) admissible 



In total there are ()() admissible sets of size at most k, and the corresponding basis elements constitute the 
Gelfand-Tsetlin basis for Furthermore, if a is any exchangeable measure then 
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For proofs, see Filmus m- 

The Gelfand-Tseltlin basis can also be characterized (up to scalar multiplication) as the common eigen¬ 
vectors of the Gelfand-Tsetlin algebra, which is generated by the Young-Jucys-Murphy elements 

Xm = m). 

i<.m 

Filmus [M Lemma 24] gives the following explicit formula: 

m 

. 1=1 

Finally, this basis can be constructed in the same way as Young’s orthogonal basis. For each d, consider 
all standard Young tableaux of shape (n —d, d), arranged in lexicographic order of the second row. With each 
tableau having first row oi,..., an-d and second row bi,... ,bd, associate the function {xa^ — Xb^) ■ ■ ■ (xa^ — 
Xb^). Running the Gram-Schmidt orthogonalization process on these functions yields the degree d part of 
the Gelfand-Tsetlin basis described above. 


XB = 


— \B D [mjKm -|- 1 — |i? fl [to] 


XB- 


10 Why harmonic functions? 

Our invariance principle compares the distribution of harmonic multilinear polynomials on the slice and 
on the Boolean cube (with respect to an appropriate measure). This empirically validates the following 
informal claim: The correct way to lift a function from the slice to the Boolean cube is through its unique 
representation as a bounded degree harmonic multilinear polynomial. 

In the same way, the classical invariance principle of Mossel et al. m suggests that the correct way to 
lift a function from the Boolean cube to Gaussian space it through its unique representation as a multilinear 
polynomial. Alternatively, given a function on the Boolean cube, we lift it to a function on Gaussian space 
by linear interpolation in all axis-parallel directions. 

In this section we discuss two other justifications for using harmonic multilinear polynomials, from com¬ 
mutative algebra and from representation theory. We briefly mention that James’ intersecting kernel theo¬ 
rem [20] (see also 0 §10.9] and [TOl §6.2]) draws together both points of view, and generalizes them from 
the slice to the multislice^ in which the object of study is an arbitrary permutation module of the symmetric 
group. We hope to explore this connection in future work. 

10.1 Commutative algebra 

Let us consider first the case of the Boolean cube { — 1,1}". The classical invariance principle [17] compares 
the distribution of multilinear polynomials over the Boolean cube and over Gaussian space. Multilinear 
polynomials arise naturally when we consider { — 1,1}" as a variety defined by the ideal I = (x^ —1,..., —1). 

The coordinate ring over the variety consists of polynomials over xi,...,x„ modulo the ideal I. It is 
immediately clear that every polynomial is equivalent to a multilinear polynomial modulo /, and a dimension 
computation shows that this representation is unique. 

We can treat the case of the slice in a similar way. The slice is a variety defined by the ideal 

n 

4 = (a:i - xi,..., x^ - Xn, Xj - fc). 

i=l 

Theorem 13.51 shows that the coordinate ring is isomorphic to the space of harmonic multilinear polynomials 
of degree at most min(fc, n — k). Intuitively speaking, the existence of xf — Xi in the ideal allows us to reduce 
every polynomial to a multilinear polynomial; the existence of X]r=i Xi — k corresponds to the harmonicity 
constraint; and the degree constraint follows from the fact that xq .. -Xq^^ = 0 (when k < njT). 

We can formalize this intuition to give another proof of Theorem 13.51 which uses arguments due to 
Blekherman [5] quoted in Lee et al. [23]. Similar arguments can be found in Engel [15] §6.2]. 
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Theorem 10.1 (Reformulation of Theorem 13.51) . Let 0 < k < n. For every polynomial P over xi,... ,Xn 
there exists a unique harmonic multilinear polynomial Q of degree at most min(fc, n — k) such that P = Q 
(mod Ik). 

Proof. We start by showing that every polynomial is equivalent modulo Ik to a harmonic multilinear poly¬ 
nomial of degree at most k. We will use the following definitions: 

• Given S' C [n], xs = Hies 

• We denote by Pd the linear space of all homogeneous multilinear polynomials of degree d, and by 
Hd C Pd its subspace consisting of harmonic polynomials. 

The first step is replacing x[ (for r > 1) by x^ using the equation xf = Xi, which holds modulo Ik. We 
are left with a multilinear polynomial. 

The second step is removing all monomials of degree larger than min(A:, n—k). If fc < n/2, then monomials 
of degree larger than k always evaluate to zero on the slicqj. If k > n/2, then we get a similar simplification 
by replacing Xi with 1 — (1 — Xi), and using the fact that the product of more than n — k different factors 
1 — Xi always evaluates to zero on the slice. We are left with a multilinear polynomial of degree at most 
min(fc, n — k). 

Definition 13.11 defines a differential operator A = X]r=i When applied to multilinear polynomials, we 
can also think of it as a formal operator which maps Xs to s-iid extends linearly to multilinear 

polynomials (this is the Lefschetz lowering operator in the Boolean lattice). When restricted to Pd, the range 
of A is Pd-i. Linear algebra shows that Pd decomposes into ker A and imA^, where A^ (the Lefschetz 
raising operator) is the operator on P^-i which maps xs to ^Suli}- Modulo Ik, 


A'^xs= ^ xsu{i}=xs ^ Xi = {k-\S\)xs = {k-d+l)xs, 


iG [n]\S 


2e[n]\S 


the third equality following logically since xs ^ 0 only if Xi = I for all i € S, and formally using xsXi = xs 
for all i G S. Thus im A^ can be identified with Pd-i. We conclude that any polynomial in Pd can be written 
(modulo Ik) as the sum of a polynomial in Hd (belonging to ker A) and a polynomial in Pd-i (belonging to 
im A^). 

Applying this construction recursively, we see that every homogeneous multilinear polynomial is equiv¬ 
alent modulo Ik to a harmonic multilinear polynomial, and moreover this operation doesn’t increase the 
degree. This completes the proof that every polynomial is equivalent modulo Ik to a harmonic multilinear 
polynomial of degree at most min(fc, n — k). 

It remains to show that this representation is unique. The calculation above shows that A^, considered 
as an operator from Pd-i to Pd for d < n/2, has full rank dimP^-i. The decomposition ker A © im A^ of 
Pd shows that when d > 0, Hd has dimension dim ker A = (//') — dimimA^ = (”) — {//i). When d = 0, the 
dimension is clearly I. It follows that the space of all harmonic multilinear polynomials of degree at most 


min(fc, n — k) has dimension 1 + 
functions on the slice. 


(d) ~ (d-i) “ (fc)’ matching the dimension of the space of 


□ 


Using very similar ideas, we can prove Blekherman’s theorem stated in Section [ 6 . II 

Theorem 10.2 (Reformulation of Theorem 16.11) . Every multilinear polynomial f over Xi,... ,x„ of degree 
at most d < n /2 can be represented uniquely in the form 


d 

f{xi,...,Xn) = '^fi{xi,...,Xn)S" (mod I), 

where I = {x\ — xi,..., x^ — x„), S' = xi + • • • + x„, and fi is a harmonic multilinear polynomial of degree 
at most d — i. 

®Formally speaking, this step requires us to work with \/7fc. However, the ideal Ik is radical, see for example |32l Lemma 
6 . 1 ], 
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Proof. Throughout the proof we use the definitions of xs,Pd, Hd, A, A'^ in the proof of Theorem 110.11 We 
also define P<d to be the space of all multilinear polynomials of degree at most d, and P[<d to be the space 
of all harmonic multilinear polynomials of degree at most d. 

We start by considering the case in which / is a homogeneous multilinear polynomial. We prove by 
induction on e that if deg / = e then for some /o,..., /e, 

e 

/ = ^ fiS"- (mod /), where fi € 

2=0 


When e = 0 this is clear, since / is already harmonic, so suppose that e > 0. Recall that A maps Pe to Pe-i- 
The conjugate operator A^ maps xa G Pe-i to 


i^A 



= {S - {e- l))a;^ 


(mod I). 


Since Pe = ker A © im A^ = Hd © im A^, it follows that f = h + {S — {e — l))g (mod I) for some h € He 
and g € Pe-i- By the induction hypothesis, for some go, ■ ■ ■ ,ge-i, 


e-l 

g = '^giS^ (mod /), where gi G H<e-i-i- 

i =0 

Substituting this in the equation for /, we obtain 

e— 1 e 

f = hp - (e - l))g^S^ = [h-{e- l)go] + (mod I). 

i—0 i—1 

Since deg[/i— (e— l)go] < max(e, e— 1) = e and deg[ 5 i_i — (e— l)gi] < max((e— 1) — {i — 1), e — 1 — i) = e — i, 
we have obtained the required representation. 

When / is an arbitrary multilinear polynomial of degree d, we get the required representation by summing 
the representations of /=°,..., f^‘^. 

Finally, we prove that the representation is unique by comparing dimensions: 




2=0 


2=0 


d 

dimH<i = dim 
2=0 


since H<e has dimension (") for e < n/2 by Theorem 13.51 □ 

This implies a surprising corollary for harmonic projections. 

Definition 10.3. Let / be a function over xi,..., Xn- Its harmonic projection on the slice is the unique 
harmonic multilinear polynomial of degree at most min(fc, n — k) which agrees with / on the slice. 

Corollary 10.4. Let f be a multilinear polynomial over xi,... ,Xn of degree d, and let be its harmonic 
projection on the slice where d < k < n — d. For k in this range, F^‘^ does not depend on k. 

Proof. Let fo,.. ■, fd be the Blekherman decomopsition of /, where ft is a harmonic multilinear polynomial 
of degree at most d — i. The function thus a harmonic multilinear polynomial which agrees with 

/ on When d < k < n — d, this polynomial has degree at most d < mm{k,n — k), and so it is the 

harmonic projection Fk of / on the slice Notice that F^'^ = f^'^ does not depend on k. □ 
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10.2 Representation theory 

As we have explained in Subsection 19.11 representation theory naturally leads to the decomposition of each 
function on the slice to its homogeneous parts. From the point of view of the representation theory of the 
symmetric group, it is natural to order the irredicuble representations in increasing order of complexity: 
S'!"), ..., This suggests defining the degree of a function / on the slice as the maximal d 

such that ^ 0; note that this definition doesn’t make use of the harmonic multilinear representation of 
/• 

Another definition which leads to the same parameter is the junta degree, which is the minimal d such that 
/ can be written as a linear combination of functions depending on at most d coordinates. The cometric 
property f Corollary I3.18p . which (apart from a few degenerate cases) only holds for this ordering of the 
irreducible representations, gives further credence to the notion of degree. 

A similar notion of degree exists also for functions on the Boolean cube: the degree of a function on 
{0,1}" is both its degree as a multilinear polynomial and its junta degree. 

The following result shows that the lifting defined by the harmonic multilinear representation is the 
unique way to couple both notions of degree. 

Theorem 10.5. Let 0 < k < nj^. Suppose L is a linear operator from to R[{0,1}"] such that 

(a) For every function f over Lf is an extension of f, that is, {Lf){x) = f{x) for every x G 

fbj For every function f over degLf < deg/. 

(c) For every function f over and for every permutation tt G Sn, L{f'^) = [Lf)^. In other words, L 

commutes with the action of Sn ■ 

Then Lf is given by the unique harmonic multilinear polynomial of degree at most k which agrees with f on 

Proof. In view of Lemma [3.61 it suffices to show that Lf = / for / = {xi — X2) ■ ■ ■ (x2d-i — X2d)- Suppose 
that Lf = g, where g is a multilinear polynomial of degree at most d. Since = —/, also g^^ = —g. If 

we write 

g = XiX 2 r + xis + X 2 t + u, 

where r, s, t, u don’t involve xi, X 2 , then g^^ = xiX 2 r + xit + X 2 S + u, showing that r = u = 0 and s = —t. 

In other words, g = (xi — X 2 )s is a multiple of Xi — X 2 . Similarly g is a multiple of 0:3 — X 4 ,..., X 2 d-i — X 2 d- 
Unique factorization forces g = C{xi — X 2 ) ■ ■ ■ {x 2 d-i — X 2 d) = Cf for some constant C. Since g extends /, 
the constant must be C = 1. □ 
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